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Chapter  1 

Introduction 


In  studying  the  behavior  of  power  functions  of  various  multivariate  tests  under  al¬ 
ternative  hypotheses  it  is  essential  to  develop  a  distribution  theory  for  noncentral  distribu¬ 
tions  in  the  multivariate  case.  It  is  somewhat  surprising  that  this  is  not  straightforward. 
In  fact  even  intuitively  obvious  results  concerning  the  power  functions  of  multivariate  tests 
often  require  fairly  elaborate  arguments  because  of  this  difficulty.  For  a  recent  example  of 
this  see  Olkin  and  Perlman  (1980). 

,  Let  us  take  the  noncentral  x2  distribution  and  its  multivariate  analog,  the  non¬ 
central  Wishart  distribution,  as  an  example.  The  density  of  the  noncentral  x2  distribution 
is  usually  written  as  an  infinite  series.  This  series  arises  from  the  expansion  of  the  exponen¬ 
tial  part  of  the  normal  density  into  a  power  series  and  its  term  by  term  integration  with 
respect  to  irrelevant  variables.  In  the  multivariate  case  this  integration  becomes  nontrivial 
involving  an  integration  with  respect  to  the  Haar  invariant  measure  on  the  orthogonal 
group.  Anderson  and  Girshick  (.1944)  and  Anderson  (1946)  were  the  first  systematic  studies 
of  this  problem.  James  (1955a,  1955b)  introduced  the  integration  with  respect  to  the  Ilaar 
measure  on  the  orthogonal  group  explicitly  and  made  further  progress.  Ilerz  (1955)  devel¬ 
oped  a  theory  of  hypergeometric  functions  in  matrix  arguments  and  expressed  the  density  of 
the  noncentral  Wishart  distribution  by  a  hypergeometric  function.  Then  James  (1960,1961) 
introduced  “zonal  polynomials”  which  have  a  special  invariance  property  with  respect  to  the 
Haar  measure  and  the  density  of  the  noncentral  Wishart  distribution  can  be  expressed  in  an 
infinite  series  involving  zonal  polynomials.  This  infinite  series  provides  an  explicit  infinite 
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series  expression  for  the  hypergeometric  function  introduced  by  Ilerz.  James  (1960,1961b)', 
Constantine  (1963,  1966)  and  others  worked  out  many  other  distributions  and  expressed 
them  in  a  similar  fashion.  Thus  zonal  polynomials  provide  a  unifying  tool  for  the  study  of 
multivariate  noncentral  distributions. 

In  spite  of  this  advantage  zonal  polynomials  have  been  considered  to  be  difficult 
to  understand.  This  is  mainly  because  the  definition  of  zonal  polynomials  required  rather 
extensive  knowledge  of  group  representation  theory,  spherical  function  theory,  etc.  This 
difficulty  was  hardly  alleviated  by  more  recent  (abstract)  treatments  by  Farrell(1976)  and 
Kates(1980).  Another  reason  is  that  explicit  expressions  for  zonal  polynomials  are  not 
known.  Furthermore  an  infinite  scries  involving  zonal  polynomials  often  converges  very 
slowly.  Because  of  these  difficulties  many  nonccntral  multivariate  densities  which  can  be 
expressed  in  infinite  series  involving  zonal  polynomials  have  been  rarely  used  and  studied 
numerically. 

The  purpose  of  this  paper  is  to  present  a  self-contained  development  of  zonal 
polynomials  for  people  working  with  multivariate  analysis.  Our  primary  goal  is  to  write  a 
readable  account  of  zonal  polynomials  in  the  framework  of  standard  multivariate  analysis 
and  our  development  in  the  sequel  might  seem  sometimes  repetitive.  Furthermore  we 
pay  fairly  close  attention  to  the  numerical  aspects  of  zonal  polynomials,  their  coefficients, 
etc.  No  knowledge  of  extensive  mathematics  is  needed  but  we  assume  that  the  reader  is 
familiar  with  usual  multivariate  analysis  (e.g.  Anderson(1958))  and  linear  algebra.  As  is 
often  the  case  with  an  elementary  aproach  we  have  to  do  a  good  deal  of  computation, 
which  sometimes  becomes  tedious.  This  is  in  a  sense  a  necessary  tradeoff  in  adopting  an 
elementary  approach.  In  any  case  usual  material  in  multivariate  analysis  requires  fairly 
heavy  computation  and  the  computation  in  the  sequel  does  not  seem  too  heavy. 

Recently  there  have  been  several  elementary  treatments  of  zonal  polynomials. 
Let  us  briefly  discuss  these  approaches  and  their  relations  to  the  present  work.  Our 
starting  point  was  Saw(1977)  who  derived  many  properties  of  zonal  polynomials  using 
basic  properties  of  the  multivariate  normal  and  the  Wishart  distributions.  Unfortunately 
at  several  points  Saw(1977)  refers  to  Constantine(1963)  which  in  turn  makes  use  of  group 
representation  theory.  Therefore  Saw(I977)  is  not  entirely  self-contained.  Actually  as 
Saw(1977)  suggested,  it  turns  out  that  only  the  elementary  methods  from  Constantine(1963) 
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are  needed  to  complete  Saw’s  argument  but  it  was  not  clear.  Furthermore  it  seemed  more 
advantageous  to  define  zonal  polynomials  in  a  different  way  than  Saw  did  and  rearrange 
his  logical  steps,  especially  because  his  definition  of  zonal  polynomials  lacked  a  conceptual 
motivation.  See  Remark  3.12  on  this  point.  In  our  approach  zonal  polynomials  will  be 
defined  as  eigenfunctions  of  an  expectation  or  integral  operator.  Actually  by  considering 
the  finite  dimensional  vector  space  of  homogeneous  symmetric  polynomials  of  a  given  degree 
we  work  with  vectors  and  matrices  and  we  define  zonal  polynomials  just  as  characteristic 
vectors  of  a  certain  matrix.  This  will  be  done  in  Section  3.1. 

The  idea  of  “eigenfunctions  of  expected  value  operators”  is  investigated  in  a  more 
general  framefork  in  recent  works  by  Kushner  and  Meisner(1980)  and  Kushner,  Lebow, 
and  Meisncr(  1981).  In  the  second  paper  they  gave  a  definition  of  zonal  polynomials.  They 
follow  James’  original  idea  but  mostly  use  techniques  of  linear  algebra  and  their  approach 
is  very  helpful  for  the  understanding  of  James’  original  definition.  They  consider  the  space 
of  homogeneous  polynomials  of  (elements  of)  a  symmetric  matrix  variable  A  whereas  we 
consider  the  space  of  symmetric  homogeneous  polynomials  of  the  characteristic  roots  of  A. 
In  the  former  approach  an  extra  step  is  needed  to  define  zonal  polynomials  by  requiring  an 
orthogonal  invariance.  Therefore  our  approach  in  Section  3.1  seems  to  be  more  direct. 

Another  welcome  addition  to  our  limited  literature  of  elementary  treatment  of 
zonal  polynomials  is  provided  by  the  recent  book  by  Muirhead(1982),  in  which  zonal 
polynomials  are  defined  as  eigenfunctions  of  a  differential  operator,  or  as  solutions  to 
partial  differential  equations.  Although  Muirhead  proceeds  informally  and  many  points 
are  illustrated  rather  than  proved,  the  approach  can  be  made  precise.  We  discuss  this  in 
Section  4.5.4.  From  our  prejudiced  viewpoint  our  approach  seems  more  natural,  especially 
because  the  differential  equation  is  hard  to  motivate. 

None  of  these  treatments  seem  to  be  as  complete  as  the  present  work. 

Zonal  polynomials  were  originally  defined  using  group  representation  theory  and 
their  properties  were  derived  from  the  definition.  An  obvious  alternative  for  us  is  to  look 
at  various  properties  satisfied  by  zonal  polynomials  and  see  whether  these  properties  can 
be  used  as  a  definition.  A  definition  must  be  first  of  all  meaningful,  namely  it  should  define 
some  mathematical  object  which  exists  in  a  unique  way.  It  becomes  a  successful  one  if  all 
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other  properties  can  be  derived  from  it.  Is  our  definition  successful?  The  answer  seems 
to  be  affirmative.  Actually  we  will  derive  almost  all  the  known  major  properties  of  zonal 
polynomials  in  a  self-contained  way  as  well  as  many  new  results.  Proofs  are  often  new. 
The  new  contributions  of  the  present  work  are  listed  in  Appendix. 

A  good  measure  to  judge  our  optimistic  claim  is  a  remarkable  paper  by  James(1964). 
It  is  expository  and  contains  many  statements  which  seem  to  have  never  been  proved  in 
publication.  In  the  sequel  we  will  often  refer  to  formula  numbers  in  this  paper. 

We  will  not  discuss  various  noncentral  densities  which  arc  written  as  infinite  series 
involving  zonal  polynomials.  In  addition  to  James(1964)  we  just  refer  to  Subrahmaniam(l976) 
which  provides  extensive  references  for  various  applications  of  zonal  polynomials.  Once 
zonal  polynomials  are  defined  and  their  properties  derived  it  seems  not  hard  to  write  down 
those  densities.  We  will  not  discuss  hypergeomctric  functions  of  matrix  arguments  either. 

On  the  other  hand  we  included  a  development  of  complex  zonal  polynomials  (zonal 
polynomials  associated  with  the  complex  normal  and  the  complex  Wishart  distributions). 
One  reason  for  this  is  that  our  approach  for  the  real  case  almost  immediately  carries  over 
to  the  complex  case.  Another  reason  is  that  the  existing  theory  of  Schur  functions  provides 
explicit  expressions  for  complex  zonal  polynomials.  This  becomes  apparent  if  one  compares 
Farrell(1980)  and  Macdonald(1979).  See  Chapter  5. 

Finally  let  us  describe  subsequent  chapters.  Chapter  2  gives  preliminary  material 
on  partitions  and  homogeneous  symmetric  polynomials.  Definitions  and  notations  should 
be  checked  since  they  vary  from  book  to  book.  In  Chapter  3  we  define  zonal  polynomials 
and  derive  their  major  properties.  If  the  reader  is  not  much  interested  in  computational 
aspects  of  zonal  polynomials  the  material  covered  in  Chapter  3  should  suffice  for  usual 
applications.  Chapter  4  generalizes  and  refines  the  results  in  Chapter  3.  It  deals  largely  with 
computation,  coefficients,  etc.,  of  zonal  polynomials.  The  development  becomes  inevitably 
more  tedious.  In  Chapter  5  we  apply  our  approach  to  complex  zonal  polynomials.  We  add 
some  material  from  Macdonald(1979)  and  show  that  complex  zonal  polynomials  are  Schur 
functions. 


Chapter  2 

Preliminaries  on  partitions 
and  homogeneous  symmetric  polynomials. 


In  this  chapter  we  establish  appropriate  notations  for  partitions  and  homogeneous 
symmetric  polynomials  and  summarize  basic  facts  about  thpm.  A  large  part  of  the  material 
in  this  section  and  section  5.3  is  found  in  Macdonald(1979),  Chapter  1. 

§2.1  Partitions. 

A  set  of  positive  integers  p  =  [p lt. . . ,  pt )  is  called  a  partition  of  n  if  n=pH - \ -pt. 

To  denote  p  uniquely  we  order  the  elements  as  Pi  >  P2  >•••>  Pt-  Pi, . .  .,pt  are  called 
parts  of  p ;  l,p\,n  are 

l  =  £{p)  =  length  of  p  —  number  of  parts, 

(2  j)  Pi  =  M?)  =  height  of  p, 

n  =  |p|  —  weight  of  p. 

respectively.  The  multiplicity  mt-  of  i,  (i  =  1, 2, . . . )  in  p  is  defined  as 
(2-2)  mi  =  number  of  j  such  that  pj  =  *. 

Using  the  m,-’s  p  is  often  denoted  as  p=(lm‘2m2. . . ).  The  set  of  all  partitions  of  n  is 
denoted  by  Pn  ( ={p  :  |pj=n}). 

It  is  often  convenient  to  look  at  p  as  having  any  number  of  additional  zeros  p  = 
0,...,0).  In  this  case  it  is  understood  that  pk  =  0  for  k  >  i{p).  With  this 
convention  addition  of  two  partitions  is  defined  by  (p  +  q)i  —  Pi  +  qi,  i=  1, 2, _ 


2.  Preliminaries  on  partitions  and  homogeneous  symmetric  polynomials. 


6 


A  nice  way  of  visualizing  partitions  is  to  associate  the  following  diagrams  to  them. 
For  p  =  (pi, . . . ,  pe)  we  associate  a  diagram  which  has  p,  dots  (or  squares)  in  i-th  row.  For 
example  the  diagram  of  (4, 2, 2,1)  is  given  by 


Figure  2.1. 

We  define  the  conjugate  partition  p'  of  p  by  means  of  this  diagram,  namely  p'  is 
a  partition  whose  diagram  is  the  transpose  of  the  diagram  of  p.  From  Figure  2.1  we  see 
(4, 2, 2,1)'  =  (4, 3, 1,1).  Clearly  p''  =  (p1)'  =  p.  Furthermore  |pj  =  |p'|,  l(p)  =  h(p'),h(p)  = 
t{p').  More  explicitly  p'  is  determined  by 

(2.3)  mi(p')  =  Pi  -  Pt+i,  i  —  1, . . . ,  L 
Therefore  for  example 

l(p')  =  mx(p')  +  m2(p')  +  •  •  • 

(2.4)  =  (pi  -p2)  +  (p2  -P3)  +  "• 

=  Pi  =  h{p). 

Let  s  >  h[p),t  >  l{p).  We  define 
(2-5)  Pl.t  =  {s-pt,s-  pt_x ,  •  •  • ,  s  -  px) 

From  Figure  2.2  we  have  (4,  2,  2, 1)|  5  =  (4, 3,  2, 2,  0).  Note  that 

(2-6)  Ktl  =  s*~M 


•  •  x  x 

t  •  •  X  X 

•  XXX 
X  X  X  X 

Figure  2.2. 
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Now  we  introduce  two  orderings  in  Pn.  The  first  one  is  called  the  lexicographic 
ordering  (>).  In  this  ordering  p  is  said  to  be  higher  than  q  (p  >  q)  if 

(2-7)  Pi  =  9ij •  •  •  ,Pjfe-i  =  qk-i,Pk  >  qk  forsomek. 

This  is  a  total  ordering.  For  example  is  ordered  as  (4) > (3,1) > (2,2) >(2,1,1)>(1, 1,1,1). 
This  ordering  is  preserved  by  addition. 

Lemma  2.1.  If  pl  >  ql,p2  >  q 2  then  p1  +  p2  >  q 1  +  q2  with  equality  iff  p1  =  p2,ql  = 
<?• 

Proof:  Let  kiy  i  =  1,2,  be  the  first  index  such  that  p'k.  >  q^..  (&*•  is  defined  to  be  oo  if 
P*  —  <z‘-)  Let  k  =  min(ki,  fc2)-  Then 

Pj  +  Pj  —  q)  +  »  f°r  j  <  k, 

(2-8)  pi  +  p\  >  qi  +  q\. 

Hence  p1  +  p2  >  q1  +  q2.  Equality  holds  if  and  only  if  k  —  oo  or  pl  —  ql,p2  =  q2.  | 


Another  ordering  is  the  majorization  ordering,  p  majorizes  q  {p  >-  q)  if  and  only 
if 

(2.9)  pi  >  qly  pi  +  p2  >  qi  +  q%,  ...  , pi  +  •  •  •  +  Pk  >  qi  +  •  •  •  +  qk,... 

Note  that  for  k  >  mox(£(p),  I(q))  the  equality  holds  because  both  sides  are  equal  to  the 
weight  n.  Majorization  is  a  partial  ordering  and  it  is  stronger  than  the  lexicographic 
ordering: 

Lemma  2.2.  If  p  >-  q  then  p  >  q. 

Proof:  Suppose  pi  =  qlr  •  • ,  pk-\  —  qk-i,Pk  7^  qk •  Then  pi  +  •  •  •  +  pk  >  qi  H - +  qk 

implies  pk  >  qk.  Hence  p  >  q.  | 
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Remark  2.1.  The  converse  of  Lemma  2.2  is  false.  For  example  (3,1,1,1)  >  (2,2,2)  but 
there  is  no  majorization  between  these  two. 

Analogous  to  Lemma  2.1  we  have 

Lemma  2.3.  If  p1  >-  ql,p2  >-  q 2,  then  pl  +  p2  ql  +  q 2  with  equality  iff  pl  =  p2  ,qx  — 

92. 

Proof :  For  any  k 

{p[  +  Pi)  +  •  * '  +  (pl  +  pl)  >  (gj  +  q\ )  +  •  •  •  +  (g[  +  q\) 
with  equality  iff  p\  H - +  P%  —  Qi  H - +  q{h,  i  =  1, 2.  | 


§2.2  Homogeneous  symmetric  polynomials. 

Let  f(xlt ...,  Xk)  be  a  polynomial  in  Xi, . . . ,  x *.  /  is  homogeneous  (of  degree  n)  if 
/  has  only  n-th  degree  terms.  /  is  symmetric  if 

(2.10)  f(x  i,...,xk)  =  f{xil,...,xik), 

where  (ii,...,*fc)  is  any  permutation  of  (xi,...,xjt).  Let  V„  denote  the  set  of  all  n-th 
degree  homogeneous  symmetric  polynomials  including  the  constant  /  =  0.  We  look  at  Vn 
as  a  vector  space  where  addition  is  the  usual  addition  of  polynomials.  Let  /  £  Vn  and 
suppose  that  /  has  a  term  axi'-’-x^1  {{pi,  -  •  •  ,Pi)  £  Pn )>  then  by  symmetry  it  also  has 

a  term  ax?(  •  •  ■  where  i\,...,ii  are  distinct  integers  taken  from  (1,. . .  ,k).  Counting  all 
different  terms  we  see  that  /  can  be  written  as  a  linear  combination  of  monomial  symmetric 
functions  Mp,  p  £  Pn  , 

(2-H)  f=T,  °pMp> 

pePn 

where 

mp=  E  <■•••<• 


(2.12) 
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In  (2.12)  we  count  only  distinguishable  terms.  For  example 

(2.13)  >1(1,!)  =  ^2  XiXj. 

i<j 

Sometimes  it  is  more  convenient  to  use  augmented  monomial  symmetric  function  AMP  for 
which  the  summation  in  (2.12)  if  over  all  permutations  of  i  different  integers  from  (1,. . .  ,k). 


Therefore 

(2.14) 

=  ]> 2  x'xi  —  2-^( 

In  general 

(2.15) 

h(p) 

y?Mp  =  ( JJ  m<!)Mp. 

»= l 

where  (pi, . . 

•tPi)  —  (lm‘2ma...). 

We  note  that  in  (2.11)  the  number  of  variables  k  does  not  play  an  explicit  role. 
Actually  Mp  can  be  defined  for  any  number  of  variables  by  (2.12)  and 

(2.16)  •Mp(a:i>  •  •  •  >  0>  •  •  •  >  0)  —  Mp[xi,...,xk). 

Hence  it  suffices  to  consider  Mp  which  is  defined  for  sufficiently  large  number  of  variables. 
Now  suppose 

(2-17)  ^2  aPMp  =  0, 

p€Pn 

We  look  at  terms  of  the  form  x\l-  •  -x\l .  Differentiating  (2.17)  pi  times  with  respect  to  xit 
we  have  ap  =  0.  Hence  Mp)p  €  Pn  are  linearly  independent  in  Vn.  (Of  course 
if  k  <  i{p)  then  >lp(xi, . . . ,  Xk)  =  0  which  is  linearly  dependent  in  a  trivial  sense.  But  as 
above  we  consider  k  to  be  sufficiently  large.  For  more  detail  see  Section  4.1.)  From  (2.11) 
and  (2.17)  it  follows  that  {  .Mp,  p  £  Pn  }  forms  a  basis  of  Vn.  This  is  a  rather  obvious  basis. 
We  want  to  consider  other  bases.  The  following  lemma  is  useful  for  this  purpose. 

Lemma  2.4.  If  A.  is  an  upper  triangular  matrix  with  nonzero  diagonal  elements,  then 
A-1  has  the  same  property. 

Proof:  By  assumption  |A|  0.  Hence  A-1  exists.  Let  A-1  =  {o,J}.  Then  ai}  =  Ay,/|A|, 

where  A,y  is  a  cofactor  of  A.  By  triangularity  of  A,  Ay<  =  0  for  i  >  j.  | 
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Corollary  2.1.  If  A.  is  an  upper  triangular  matrix  with  diagonal  elements  1  and  integral 

offdiagonal  elements ,  then  A-1  has  the  same  property. 

Proof :  |A|  =1.  Furthermore  A is  an  integer  since  the  elements  of  A  are  integers.  | 

Now  we  consider  products  of  elementary  symmetric  functions.  Let 

(2.18)  Ur=  ^2  *tVX»'r 

*1<  •••«, 

be  the  r-th  elementary  symmetric  function.  For  p  E  Pn  we  define 

(2.19)  Up  =  upll-p*u?-p3---upll. 

The  degree  of  Up  is 

(2.20)  ( pi  -  p2)  +  2(p2  -  P3)  H - +  tpi  —  pi  +  •  •  •  +  Pi  =  n. 

Hence  £/p  E  Vn.  ifp  defined  by  (2.19)  corresponds  to  Up>  in  Macdonald’s  notation  (1979). 
Lemma  2.5. 

(2.21)  Up  =  Mp  +  y  '  OpgMq, 

9<P 

where  apq  are  integers. 

Proof :  Consider  monomial  terms  of  the  form  x^Xq2’  -  •  xff,  q  =  (qi, . .  .,qk)  €  Pn ■  Now 

Up  =  (xi  d - )Pl_P2(xiX2  d - )P2-P3*-*(xi"-X^d - )P1. 

Hence  the  highest  order  term  obtained  by  expanding  Up  is 

xfl_P2(xix2)Pa~P3---(xr--x^)P4  =  xPi  xPi  •  •  •  x%1, 

which  has  coefficient  1.  It  is  clear  that  other  terms  are  lower  in  the  lexicographic  ordering 
and  have  integral  coefficients.  | 
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Remark2.2.  For  a  stronger  result  see  Lemma  4-1- 

We  order  MP,UP,  p  €  Pn  according  to  the  lexicographic  ordering  and  form  two 

vectors: 


(2.22) 

M  = 

r  M(„)  ^ 
•M(n-l.l) 

,  u  = 

(  tt(n)  > 

<  ^(l-)  ; 

2i(l»)  j 

Then  Lemma  2.5  implies  that 

(2.23)  U  =  AM,.  A  =  (ap,), 

where  A  is  a  matrix  satisfying  the  condition  of  Corollary  2.1.  Therefore  considering  A-1 
=  ( apq )  we  obtain 

(2.24)  Mp  =  Up+J2aP9Ui> 

Q<P 

where  apq  are  integers.  We  see  that  {Up,p  6  Pn}  forms  another  basis  of  Vn. 

Product  of  U  functions  corresponds  to  the  addition  of  partitions. 

Lemma  2.6. 

(2.25)  UpUq  =  Up+q . 


Proof: 
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The  third  basis  of  V„  is  given  by  product  of  power  sums.  Let 


(2.26) 


tr  = 


For  p  6  Pn  we  define 


(2.27) 


Tp  defined  by  (2.27)  corresponds  to  Tp>  in  Macdonald(1979)  and  in  Saw(1977).  Here  we 
prefer  the  above  definition  because  of  the  simpler  relation  between  Up  and  Tp. 

Let 


(2.28) 


m  =  if1  +  SX{ )  =  1  +  UiS  +  U2S2  +  . . . 


be  a  generating  function  of  u's.  Then 


log  U{s)  =  J3log(l  +  s«») 


(2.29) 


sfj - —  $2  +  •  •  •  +  (—  l)r  1  — tr  + 

u  T 


On  the  other  hand 


(2.30)  logf/(s)  =  (uis  4-  U2S2  H - )  —  ^(«is  +  u2s2  +  •  ••  )2  + 

Lt 

Comparing  coefficients  of  sr  in  (2.29)  and  (2.30)  we  see 


(2.31) 


tr  =  (-l)r  1f{«r+ 

g>(l').9GPr 

=  (-l  Y~1r{U(lr)+  »rqUg). 

«>(i  r),«e/V 


It  is  straightforward  to  show  that 


_(-l)f»-i/ 

a  —  - 

9i  V9 


9i 


<9i  “  92,92  —  93, 

(-i^-Hgi  - 1)! 

(9i  —  92)!*  *  *  9£(«)J 


■•>9*(g)) 


(2.32) 
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IS 


Now 


(2.33) 


t(p) 

tp  =  n 

r  =  l 


t(p) 


=  n 

r  =  l 


(-l)r  ir  {Zi( ir)  +  ar,2i,} 

9>(lr),96^r 


Pr~ Pr  +  l 


By  Lemma  2.1  and  Lemma  2.6  the  lowest  order  term  in  (2.33)  is  given  by 


(2.34) 


Hence 

Lemma  2.7. 

(2.35) 

where 

(2.36)  . 

Let 


n  [(-ir  ^(k)] 


Pr  Pr  +  l 


r=l 


=  n  [(-ir-'rr  pr+iup1'-p*u?-p3-..upli 


r  —  1 


=  (_1)|P|-Pl(jjrp.-P,  +  I^p. 


r=l 


Tp  =  E  «rA. 

9>P 


l{p) 


app  =  (-ljW-Pi  JJ  rP--P'+i  ^  o. 


r  =  l 


f  T(n)  \ 


T= 


Then  Lemma  2.7  shows  that 
(2.37) 


\  V)  ) 


r=m, 


where  Fis  lower  triangular  with  nonzero  diagonal  elements.  Hence  {  Tp,p  € 
basis  of  Vn. 


Pn  }  forms  a 
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Remark  2.3.  To  show  that  {  Tp,p  €  Pn  }  «  a  Aasia  #f  ts  mucA  easier  to  note 

Tpi  —  AMp  +  )  ]  ClpqAMq, 

9>P 

) 

where  apq  are  integers  and  use  (Corollary  2.1.  But  we  will  use  Lemma  2.1  in  Section  4-6. 

We  study  symmetric  functions  further  in  Section  5.3.  However  the  material 
covered  so  far  suffices  to  derive  zonal  polynomials  which  form  another  basis  of  Vn. 


Chapter  8 

Derivation  and  some  basic  properties  of 

zonal  polynomials 


In  this  chapter  we  define  (real)  zonal  polynomials  and  derive  some  properties. 
Some  remarks  on  notation  seem  appropriate  here.  We  define  zonal  polynomials  as  charac¬ 
teristic  vectors  of  a  certain  linear  transformation  r  from  Vn  to  Vn.  The  normalization 
is  rather  arbitrary  for  a  characteristic  vector  and  many  properties  of  zonal  polynomials 
are  independent  of  particular  normalization.  Corresponding  to  different  normalizations, 
different  symbols  such  as  ZP,CP  have  been  used  to  denote  zonal  polynomials.  We  find  it 
advantageous  to  use  still  another  normalization  in  addition  to  those  corresponding  to  ZP,CP. 
Considering  these  circumstances  we  use  t/p  for  an  unnormalized  zonal  polynomial,  t  l/p  is 
used  to  denote  a  zonal  polynomial  normalized  so  that  the  coefficient  of  Up  or  Mp  is  1. 

§3.1  Definition  of  zonal  polynomials 

As  mentioned  earlier  we  define  zonal  polynomials  as  characteristic  vectors  of  a 
certain  matrix.  The  matrix  in  question  will  be  triangular  and  we  begin  by  a  lemma 
concerning  a  triangular  matrix  and  its  characteristic  vectors. 

Lemma  3.1.  Let  T  =  (<tJ)  be  an  n  X  n  upper  triangular  matrix  with  distinct  diagonal 
elements.  Let  A=diag(£n, . . . ,  tnn).  Then  there  exists  an  upper  triangular  matrix  3 
satisfying 


(3.1) 


sr=  as. 
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S  is  uniquely  determined  up  to  a  (possibly  different)  multiplicative  constant  for  each  row. 


Proof :  The  diagonal  elements  of  T  are  the  characteristic  roots.  (3.1)  shows  that  the  i-th 
row  £(•  of  B  is  a  characteristic  vector  (from  the  left)  associated  with  tn.  Let 


(3.2) 


y'T=  {pi 


■<:  a 


tuv'- 


Given  yi  =  c  (3.2)  becomes 


(3.3)  {ti  i  c,  ctj  +  y2T2)  =  (*ii  c,tny2). 

Therefore 

(3.4)  y2{T2  -  tnl)  =  -ct\. 

By  assumption  T2  —  tul  is  an  upper  triangular  matrix  with  nonzero  diagonal  elements. 
Hence  y2  is  uniquely  determined  as 


(3.5)  y2  =  -ct\[T2  -  tul)-1  (c  ^  0). 
Now  for  i  >  1  let 

(3.6)  y'T—  tay' . 


Then  considering  first  i  —  1  elements  we  have 


(3.7) 


2/i*ii  =  ***2/i 

2/1*12  -1-  2/2*22  =  tay2 


3/l*l,»'-l  +  *  •  •  +  2/*— 1** — 1,» — 1  —  UiVi-l • 


Inductively  solving  this  we  obtain  0=yi=-  •  *=2/* — l *  Then  (3.6)  reduces  to  an  equation 
like  (3.2)  with  a  smaller  dimensionality.  Hence  arguing  as  above  we  see  that  given  yi  =  c, 
(yi+i, . . . ,  yn)  is  uniquely  determined.  fl 
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Remark  3.1.  This  lemma  seems  to  be  well  known  to  people  in  numerical  analysis 
although  an  explicit  reference  is  not  easy  to  find.  It  is  very  briefly  mentioned  on  page  365 
of  Stewart(1973)  in  connection  with  the  QR  algorithm.  The  QR  algorithm  is  designed  to 
transform  a  general  matrix  to  a  triangular  form  in  order  to  obtain  the  characteristic  roots 
and  vectors. 


For  a  k  X  k  matrix  A  =  (atJ)  we  denote  its  characteristic  roots  by 


(3.8) 


ot  =  (aj, . . . ,  afc)  =  \(A), 


and  (the  determinant  of)  a  principal  minor  by 
(3.9)  A(il}...,it)  = 

For  a  matrix  argument  we  define 


a*it'z 


aiiit 


UP(A)  =  Up{«)  =  UP(\(A)). 


As  is  easily  seen  by  expanding  the  determinant  | A  —  XJ]  the  r-th  elementary  symmetric 
function  of  the  roots  of  a  matrix  A  is  equal  to  the  sum  of  r  X  r  principal  minors, namely 

(3.10)  Z/(i')(A)  =  ur  (ai,...,  ak)  =  ^2  A{iu...,ir). 

tt  <•••<», 

(See  Theorem  7.1.2  of  Mirsky(1955)  for  example.)  Hence 

(3.11)  um)  =  E 

»1  »’l<t3 

Accordingly  V„(A)  denotes  the  vector  space  of  n-th  degree  homogeneous  symmetric  poly¬ 
nomials  in  the  roots  of  A. 


Now  consider  a  (linear)  transformation  tv  :  Vn(A)  — >  V„(A)  defined  by 


(3-12)  (t„(Up))(A)  =  MP)(A)  =  £w{Up(AW}}, 

where  A  is  a  symmetric  matrix  and  W  is  a  random  symmetric  matrix  having  a  Wishart 
distribution  W(Iky^}.  Here  W{S,  v)  denotes  the  Wishart  distribution  with  covariance  £ 
and  degrees  of  freedom  u.  (t„  is  defined  for  the  basis  {Up}  by  (3.12)  and  for  general  elements 
of  Vn(A)  tu  is  given  by  the  linearity  of  expectation.)  First  we  need  to  verify: 
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Lemma  3.2.  If  A  is  symmetrix,  then  [tuIIp){A)  €  Vn(A). 

Proof :  Since  A  is  symmetric  it  can  be  written  as  A  —  rDT1  where  J1  is  orthogonal  and 
D  =diag afc).  Now  UP(AW)=  Up{rDr'W)=  Up{Dr'WI)  because  the  nonzero 
roots  are  invariant  when  the  matrices  are  permuted  cyclically.  Since  the  distribution  of 
r'WT is  the  same  as  the  distribution  of  W,  we  can  take  A  =  diag(«i, . . . ,  ak )  without  loss 
of  generality.  Then 

(3.13)  AW(»i,...,*r)  =  («»-,  •  •  •  a*r)W(ii> . . . ,  ir)« 

For  example 

aitou  a  it«i2  toil  w\2 

(3.14)  AW[  1,2)=  =a^a  2 

a2«l21  Q!2t022  w2t  w22 

From  (3.10)  and  (3.13)  the  r-th  elementary  symmetric  function  of  the  characteristic  roots 
of  AW  can  be  written  as 

(3.15)  nr(X(AW))=  aii---airW{iu'--,ir)- 

»!<...«, 

Substituting  this  into  (3.11)  and  taking  the  expectation  we  obtain 

(3.16)  (r^p)(A)  =  fw(E«nW(ii))Pl"Pa(  ]C  W(*i. *2))P2_P3- •  • . 

»i  »i<»a 

Clearly  this  belongs  to  V„(A).  I 

t„  has  the  following  triangular  property. 

Corollary  3.1. 

(3.17)  (tuIIp){A)  =  \upUp{A)  +  ^2  apqUq{A). 

q<p 

Proof:  As  in  the  proof  of  Lemma  2.5  the  highest  monomial  term  in  (3.16)  is  of  the  form 

(3.18)  <?w{W(l)Pl-p*W(l,  if-**- •  •  W(l, . . .,  if'}. 

Then  using  (2.24)  we  see  that  {rvUp)[A)  expressed  as  a  linear  combination  of  Uq  s  involves 
only  q’s  such  that  q  <  p.  In  particular  the  leading  coefficient  is 

(3.19)  X„p  =  W(l,  2)P3~P3*  •  •  W(  1, . . . ,  £)p‘}.- 


I 
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Remark  3.2.  The  constants  apq  in  (3.17)  depend  on  the  degrees  of  freedom  v. 

Remark  3.3.  To  be  complete  we  have  to  verify  that  (3.12)  does  not  depend  on  the 

number  of  variables  k  or  more  precisely  we  need  to  verify 

(3.20)  (T1/i/p)(a1, . . .  ,ak,  0, . . .  ,0)  =  {rvUp){au  ak), 

for  any  number  (m)  of  additional  zeros.  Note  that  the  left  hand  side  is  defined  using 
expectation  with  respect  to  'W(Ik^.m,v).  Now  recall  that  the  marginal  distribution  of  the 
k  X  k  upper  left  hand  corner  of  yi)(Ik+m,v )  is  W(Ik,u )  and  (3.16)  depends  only  on  the 
k  X  k  upper  left  hand  corner  of  the  Wishart  matrix.  Therefore  (3.20)  holds. 

By  Corollary  3.1  t„  expressed  in  an  appropriate  matrix  form  is  an  upper  triangular 
matrix.  In  order  to  apply  Lemma  3.1  we  want  to  evaluate  the  “diagonal  elements”  X„p  in 
(3.19).  For  that  purpose  we  use  the  following  well  known  result. 


Lemma  3.3.  Let  W  be  distributed  according  to  W(Ik,i').  Let  T  —  (ftJ)  be  a  lower 
triangular  matrix  with  nonnegative  diagonal  elements  such  that  W  =  7T'.  Then  tij,  i  > 
j,  are  independently  distributed  as  tij  ~  M(0, 1),  i  >  j,ta  ~  x(i/  —  t'+l)  where  x(i/  — z  +  l) 
denotes  the  chi- distribution  with  u  —  i  +  1  degrees  of  freedom. 

Proof:  This  is  implicit  in  the  derivation  of  Wishart  distribution  in  Anderson  (1958), Chapter 
7,  or  see  Srivastava  and  Khatri(1979),  Corollary  3.2.4.  | 


Corollary  3.2. 


(3.21) 


=  2"  n  ri« +:(«'+!-  ili/nh-' + 1  -  oi 

-*■6(^0 

1=1  v  'Pi 

=  v(y  +  2)*  •  •  (i/  +  2  (pi  —  1)) 

•{u  -  l)(v  +  1).  •  •  (v  -  1  +  2(p2  -  1)) 

•  {»  -  £  + 1).  •  •  (u  -  l  +  1  +  2{pt  -  1)), 
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where  £  =  £{p)  and  (a)*  =  a(a  +  !)•  •  •  (a  +  k  —  1). 


Proof :  Note 

(3.22)  W[lf...,r)=*(in---frr)2. 

Substituting  this  into  (3.19)  we  obtain 

(3.23)  X„p  =  £  On1  £22*' '  ‘  tel1}- 

Now  £ft  is  distributed  according  to  x2(I/  —  *  +  1)  and  £t\fl  —  [v  —  i  +  l)(v  —  i  +  3)-  •  • 
(v  —  i  +  1  +  2 (pi  —  1)).  From  this  we  obtain  (3.21).  | 


This  proof  is  given  in  Constantine(1963)  in  a  slightly  different  form. 


Using  the  vector  notation  introduced  in  (2.22)  let 

f  Tu{U(n))  \ 


(3.24) 


t„(U)  = 


Ti/{]d[n— 1,1)) 


Then  Corollary  3.1  shows  that 
(3.25) 


V  J 

ru(U)  =  TM, 


where  Tu  is  an  upper  triangular  matrix  with  diagonal  elements  tpp  —  \„p.  Tv  almost  fits 
the  condition  of  Lemma  3.1.  The  question  now  is  what  v  to  take.  Actually  a  particular 
choice  of  v  does  not  matter;  we  have: 


Lemma  3.4.  There  exists  an  upper  triangular  matrix  B  such  that 
(3.26)  BTV  =  Av  3,  for  all  v, 

where  Au  ==  diag(X1/p,p  €  Pn).  B  is  uniquely  determined  up  to  a  ( possibly  different) 
multiplicative  constant  for  each  row. 

Lemma  3.4  shows  that  Tu  has  the  same  set  of  characteristic  vectors  (from  the  left) 
for  all  v.  A  proof  of  this  will  be  given  later  in  this  section,  Now  we  define  zonal  polynomials 
using  this  3. 
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Definition  3.1.  ( zonal  polynomials) 

Let  B  be  as  in  Lemma  3.4.  Zonal  polynomials  t/p,  p  €  Pn,  are  defined  by 

f  A 

V  (n— 1,1) 


(3.27) 


y  = 


y<.  i”) 


BU. 


Remark  3.4.  B  is  upper  triangular  and  therefore  ]jp  is  a  linear  combination  of  Uq ’s 
(or  f\q’s)  for  which  q  <  p.  It  follows  that  {  yp,P  €  Pn  }  forms  a  basis  ofVn. 


Remark  3.5.  Since  each  row  of  B  is  determined  uniquely  up  to  a  multiplicatie  con¬ 
stant  yp  is  determined  up  to  normalization.  We  use  l/p  to  denote  an  unnormalized  zonal 
polynomial. 

In  order  to  prove  Lemma  3.4  we  first  establish  that  the  2V s  commute  with  each 

other. 

Lemma  3.5. 

(3.28)  ,  T„TM  =  Tm2V. 


Proof :  For  a  symmetric  positive  semi-definite  matrix  A  let  A$  be  the  symmetric  positive 
semi-definite  square  root,  i.e.,  =  rD%r'  where  r  is  orthogonal  and  D  is  diagonal  in 

A  =  rDT'.  Now  let  W,Vbe  independently  distributed  according  to  ’W(Ik,i>),  1 
respectively.  Consider 

(3.29)  £w,v{U{A*VA?W)}, 

where  U  . .  . ,  U^))'.  Taking  expectation  with  respect  to  W first  we  obtain 

£w,v{U{A?VA*W)} 

=  £v{TJl{A*VAt)} 

(3.30)  =£v{TM(AV)} 

=  T„£v{U{AV)} 

=  TuT^U[A). 
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We  used  the  cyclic  permutation  of  the  matrices  since  nonzero  characteristic  roots  are 
invariant.  Similarly  taking  expectation  with  respect  to  V  first  we  obtain 

(3.31)  £w<v{U{aWa±W)}  =  TpTuU{A). 

Hence  TuTpU{A)  =  TpTM{A)  for  any  symmetric  A.  Therefore  T„Tp  =  TyLTv.  | 

See  Theorem  2.2  of  Kushner,  Lebow,  and  Meisner(1981)  for  an  analogous  result 
in  a  more  general  framework. 

Now  we  give  a  proof  of  Lemma  3.4. 

Proof  of  Lemma  3.4:  Consider  \up  given  by  (3.21).  Let  us  look  at  X„p  as  a  polynomial  in 
v.  They  are  different  polynomials  since  they  have  different  sets  of  roots.  Now  two  different 
polynomials  can  match  only  finite  number  of  times.  It  follows  that  for  sufficiently  large  u0, 
X„0 p,  p  €  Pn,  are  all  different.  Let  uq  be  fixed  such  that  X„oP,  p  €  Pn  are  all  different.  Let  S 
be  the  matrix  in  (3.1)  with  T  replaced  by  TUo.  Note  that  the  uniqueness  part  of  Lemma  3.4 
is  already  established  now.  Let  A  —  diag(X1/oP,  p  €  Pn )•  Then  for  any  p  A(STfi)—  ( AS)TP 
=  {ST„0)Tp  =  S{TU0Tp)  =  S(T „T„0)  =  {BTp)TUQ,  or  ABy  =' 3 xTUo  where  3t  =  BTp. 
Now  by  the  uniqueness  part  of  Lemma  3.1  we  have  Si  =  DB  for  some  diagonal  D  or 
BTp  —  DB.  Considering  the  diagonal  elements  we  see  that  D  =  Ap  =  diag(Xpp,p  E  Pn). 
Therefore  BTp  —  Ap3  for  all  p.  | 

We  defined  zonal  polynomials  by  defining  their  coefficients.  From  a  little  bit  more 
abstract  viewpoint  they  are  eigenfunctions  of  the  linear  operator  tv  and  the  results  in  this 
section  can  be  summarized  as  follows. 

Theorem  3.1.  Let  l/p  be  a  zonal  polynomial  then 

(3.32)  (r,i/p)(A)  =  £wyp(AW)  =  X,pt/p(A), 

where  W  ~  'W[Ik,v))  A  is  symmetric  and  X„p  is  given  in  (3.21).  Conversely  (3.32)  (for 
all  sufficiently  large  v)  implies  that  l/p  is  a  zonal  polynomial. 
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Proof :  y  =  (l/(n), ]/(„_ . . ,  y (!«))'  —  SU.  Hence  by  Lemma  3.4 


(3.33) 


£w{y{AW)}  =  £w{SU{AW)} 
=  B£w{U(AW)} 
=  BTuU{A) 

=  AVSU{A) 

=  A„y{A). 


Therefore  (3.32)  holds.  Conversely  suppose  that  (3.32)  holds.  Let  l/p  =  Ylq^p  aqUq-  Then 
(3.32)  implies 

CL  Ty  -  Xypfl  ) 

where  a'  =  (d(„), . . . ,  a^.,)).  Now  by  the  uniqueness  part  of  Lemma  3.4  a'  coincides  with 
the  “ p-th row  of  H  up  to  a  multiplicative  constant.  Therefore  t/p  is  a  zonal  polynomial. 

1 

Corollary  3.3. 

(3.34)  ewyp[AW)  =  \vpyp{AE), 
where  W  is  distributed  according  to  W(E,u). 

Proof :  let  W=  E%W\E%  then  Wi  is  distributed  according  to  Therefore 

£wyp{AW)  =  £wyp{A2l*WiZ?) 

(3.35)  =  £wyp{I$Al$Wi)  =  \vpyp{E*AE*)  =  \vpyp{AE). 


1 


The  converse  part  of  Theorem  3.1  will  be  used  frequently  to  show  that  a  particular 
symmetric  polynomial  is  a  zonal  polynomial.  See  Sec.  3.3,  Sec.  4.4,  and  Sec.  4.7. 

§3.2  Integral  identities  involving  zonal  polynomials 

In  addition  to  (3.32)  the  zonal  polynomials  satisfy  other  integral  identities.  The 
fundamental  one  (Theorem  3.2)  is  related  to  the  uniform  distribution  of  orthogonal  matrices. 


3.  Derivation  and  some  basic  properties  of  zonal  polynomials 


24 

The  idea  of  “averaging  with  respect  to  the  uniform  distribution  of  orthogonal  matrices”  or 
“averaging  over  orthogonal  group”  was  a  very  important  idea  of  James  for  the  motivation 
of  introducing  the  zonal  polynomials.  Therefore  it  seems  worthwhile  here  to  develop  the 
uniform  distribution  of  orthogonal  matrices  in  a  constructive  way,  although  a  general  theory 
of  Haar  measures  readily  establishes  it  (see  Nachbin(1965),  Halmos(1974)  for  example). 

Definition  3.2.  A  random  orthogonal  matrix  H  is  said  to  have  the  Haar  invariant 
distribution  or  the  uniform  distribution  if  the  distribution  of  HP  is  the  same  for  every 
orthogonal  J1. 

See  Anderson(1958), Chapter  13.  More  formally 

Definition  3.2’.  A  probability  measure  P  on  the  Borel  field  of  orthogonal  matrices  is 
Haar  invariant  if 

(3.36)  P{A)  =  P{AI) 

for  every  orthogonal  r  and  every  Borel  set  A. 

We  now  examine  existence  and  uniqueness  of  Haar  invariant  distribution.  We 
settle  the  uniqueness  question  first. 

Lemma  3.6.  Let  two  probability  measures  Pi,P2  satisfy  (3.86).  Then  P\[A)  =  P2 (A) 
for  every  Borel  set  A.  Furthermore  Pi(A)  —  P\{A ')  where  A'  =  {H'  \  H  <E  A}. 

Proof :  Let  Hi,H2  be  independently  distributed  according  to  Pi,P2  respectively.  Then 

(3.37)  Pr{H\H?2  €  A)  =  £H,  {Pr{HxH'2  €  A  |  H2)}  =  £Hi {P1{A)}  =  Pi{A). 
Similarly 

(3.38)  Pr(iT1H/2  G  A)  =  Pr(JT2JT1  <E  M)  =  £Hl  {Pr{H2H\  €  A'  \  Hv)}  =  P2{A>). 
Hence 

(3.39)  Pt{A)  =  P2{A'), 

Putting  Pi  =  P2  we  obtain  Pi  (A)  =  Pi(A'),P2[A)  =  P2(j 1').  Substituting  this  into  (3.39) 
we  obtain  Pi{A)  =  P2{A).  | 
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Remark  3.6.  For  a  more  rigorous  proof  (S.S7)  and  (3.38)  have  to  be  converted  to  the 
form  of  Fubini’s  theorem,  as  is  done  in  standard  proofs  (see  Section  60  of  Halmos(197\)). 
However  in  the  latter  form  the  change  of  variables  and  the  change  of  order  of  integration 
seems  to  be  harder  to  understand  intuitively.  The  same  remark  applies  to  the  proof  of 
Lemma  3.8  below. 

Remark  3.7.  If  H  has  the  uniform  distribution  then  for  every  orthogonal  H\  ,  HXH 
has  the  uniform  distribution.  This  follows  from  the  fact  that  HXH  and  HXHR  have  the 
same  distribution  for  every  orthogonal  R  The  second  assertion  of  Lemma  3.6  shows  that 
if  H  is  uniform  then  H'  is  uniform. 

Existence  can  be  very  explicitly  established  as  follows. 

Lemma  3.7.  (Saw(1970))  Let  U  =  (uty)  be  a  k  X  k  matrix  such  that  are  inde¬ 

pendent  standard  normal  variables.  Then  with  probability  1,  U  can  be  uniquely  expressed 
as 

(3.40)  U=TH, 

where  T  —  (tio)  is  lower  triangular  with  positive  diagonal  elements  and  H  is  orthogonal. 
Furthermore  (i)  T  and  H  are  independent,  (ii)  H  is  uniform,  (Hi)  ft-y  are  all  independent 
and  ta  ~  x(£  -  i  +  1),  <fy  ~  M{ 0, 1),  i  >  j. 

Proof:  U  is  nonsingular  with  probability  1.  Therefore  suppose  |L/j  ^  0.  Now  performing 
the  Gram-Schmidt  orthogonalization  to  the  rows  of  U  starting  from  the  first  row  we  obtain 
SU  =  H  where  S  is  lower  triangular  with  positive  diagonal  elements  and  H  is  orghogonal. 
Letting  T—S  1  we  obtain  (3.40).  Since  (3.40)  corresponds  to  the  uniquely  defined  Gram- 
Schmidt  orthogonalization  T,  H  are  unique.  Now  W  =  UlF  =  TT'  is  distributed  according 
to  W(Ifc,  k).  Hence  (iii)  follows  from  Lemma  3.3.  To  show  (i)  and  (ii)  we  first  note  that 
for  any  orthogonal  r,  UT  has  the  same  distibution  as  U.  Furthermore  UT  =  T[HJT). 
Therefore  Hr  is  the  resulting  orthogonal  matrix  obtained  by  performing  Gram-Schmidt 
orthogonalization  to  the  rows  of  [/T  and  T  is  common  to  U  and  UR  This  implies  that 
given  Tthe  conditional  distributions  of  H  and  HR  are  the  same.  Therefore  the  conditional 
distribution  of  H  given  T  is  uniform.  Now  by  unconditioning  we  see  that  T  and  H  are 
independent  and  H  has  the  uniform  distribution.  | 
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Now  we  prove  the  following  fundamental  identity  (James  (1961)).  The  proof  is 
only  a  slight  modification  of  one  in  Saw(1977). 

Theorem  3.2.  Let  A,B  be  k  X  k  symmetric  matrices.  Then 

(3.41)  £Hyp(AHBH')  =  yP(A)yp(B)/yp(ik), 

where  k  X  k  orthogonal  H  has  the  uniform  distribution. 

Proof:  Let  X(A)  =  oc  =  (ati, . . . ,  ak)  and  \(B)  =  (3  =  (pi  ,...,Pk).  Let  A  — 
HiDiH'i,  B  H2D2H'2,  where  Hi,  H2  are  orthogonal  and  Z>i  —  diag(on, . . . ,  a^), 
D2  =  diag(/?i, . . . ,  /3fc).  Now 

yp[AHBH!)  =  yp{H1DlH'lHH2D2Hi2H') 

(3.42)  =  y^DiHzD^), 

where  H2  =  H\HH2  which  has  the  uniform  distribution  (see  Remark  3.7).  Therefore 

(3.43)  £Hyp(AHBlT)  =  Suy^DiHD^). 

This  depends  only  on  a  =  (ai, . . . ,  a*),  and  P  =  [Pi, . . . ,  pk).  Now  for  any  permuta¬ 
tion  matrix  P  ,  A  =  (HXP)  (PD1P)  (. PH i).  Noting  that  a  permutation  matrix  is  or¬ 
thogonal  we  get  SHyp  (DxHD2Hr)  =  S^y  p(P  D  XPHD2H').  therefore  (3.43)  is  symmetric 
in  ai,... ,  ak.  Similarly  it  is  symmetric  in  Pi,. ..,  Pk.  Furthermore  given  p  =  (Pi, . . .  ,Pk), 
UP(DXHD2H')  is  a  homogeneous  polynomial  of  degree  n  in  (ai, ... ,  oik)  (see(3.15)).  Therefore 
taking  expectation 

(3.44)  £Hyp(DiHD2H!)  &  Vn(A)  for  each  P  =  (pu ...,  pk), 
and  we  can  write 

(3.45)  £Hyp(DiHD*Bf)  =  £  cq(p)yq(cx). 

q  ePn 

Now  looking  at  (3.45)  as  a  function  of  P  (for  fixed  ex)  we  see  that  cq(P)  (E  Vn(B).  Hence 

cq[P)  — 

q'e  Pn 


(3.46) 
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Substituting  (3.46)  into  (3.45)  and  returning  to  symmetric  A,  B  we  obtain 

(3.47)  £Hyp{AHBH')  =  £  cqq,y  q{A)y  q,{B). 

Note  that  cqqi  =  cq>q  because  y P[AHBH')  =  yp(BH'AH)  and  H'  has  the  uniform 
distribution.  Now  let  A  be  distributed  according  to  W(B,i/o)  where  uq  is  such  that  X^oP, 
p  £  P  are  all  different  (see  the  proof  of  Lemma  3.4).  Then  by  Corollary  3.3 

(3.48)  £A£Hyp{AHBH')  =  £  cqq.\„oqyq{E)yq,{B). 

9,9' 

On  the  other  hand  taking  expectation  with  respect  to  A  first  we  obtain 

£h  &Ay  p[AHBH!)  =  \t/oP£Hyp(BHBHf) 

(3-49)  =KaP'£cqq,yq(s)yq,(B). 

9.9' 

Therefore 

(3.50)  0  =  x;(\„0  p  -  \voq)cqq,yq{E)yq,{B). 

9,9' 

This  holds  for  any  B  and  B.  Hence  (X„oP  —  X„0,)c9,/  =  0  for  all  q,  q' .  Since  \Uop  7^  X^, 
for  p  7^  q  we  have  cqq>  =  0  for  all  q'  and  all  q  7^  p.  But  cqq>  =  cqiq.  Therefore  cqqi  =  0 
unless  q  —  q'  =  p.  Therefore 

(3.51)  £Hyp{AHBH>)  =  cppyp{A)yp[B). 

Putting  B  =  Ik  we  obtain 

(3.52)  yp(A)  =  Cppyp{ik)yp{A). 

Hence  cppyp[Ik )  =  1  and  this  proves  the  theorem.  | 

For  more  about  this  proof  see  Section  4.1. 

Theorem  3.2  implies  the  following  rather  strong  result.  - 
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Theorem  3.3.  Suppose  that  a  kXk  random  symmetric  matrix  V has  such  a  distribution 
that  for  every  orthogonal  j T,  PVT'  has  the  same  distribution  as  V.  Then  for  symmetric  A 

(3.53)  £vyp(AV)=cpyp(A), 
where 

(3.54)  cp  =  £v{yp(V)}/yp(Ik). 

Proof:  As  in  the  proof  of  Lemma  3.2  £yyp[AV)  £  V„(A).  Now  since  the  distribution  of 
PVT1  is  the  same  as  V we  have 

(3.55)  £vyp[ATVr')  =  £vyp{AV). 

Letting  T*  be  uniformly  distributed 

£vyp{AV)  =  £v  £vyp{Arvr') 

=  £v£ryp(ATVT') 

(3.56)  =  £v{yP(A)yP(v)/yp(ik)} 

=  yP(A)£v{yP(v)}/yP(ik). 

■ 

Remark  3.8.  In  the  sequel  we  call  the  distribution  ofV  “orthogonally  invariant”  if  it 
satisfies  the  condition  of  Theorem  3.3. 

Although  Therem  3.3  has  not  been  explicitly  stated,  it  has  been  implicitly  used 
for  several  cases;  first  with  the  multivariate  beta  distribution  by  Constantine(1963),  later 
with  the  inverted  Wishart  distribution  by  Khatri(1966)  etc.  These  cases  will  be  examined 
in  Section  4.3  together  with  the  evaluation  of  cp  for  each  case. 

We  note  that  Theorem  3.3  is  a  generalization  of  Theorem  3.1.  If  we  examine 
the  proof  of  Theorem  3.2  closely  we  find  that  we  can  take  A  =  in  (3.48)  and 

(3.49)  where  V  has  any  orthogonally  invariant  distribution  provided  that  cp,  p  £  Pn,  are 
all  distinct  for  that  distribution.  Furthermore  the  construction  of  zonal  polynomials  in 
Section  3.1  works  with  any  such  distribution.  Although  the  Wishart  distribution  seems 
to  be  a  natural  candidate,  we  could  have  used  any  such  distribution  from  a  purely  logical 
point  of  view. 

Orthogonally  invariant  distributions  are  characterized  as  follows. 
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Lemma  3.8.  Let  V  =  HDH'  where  H  is  orthogonal  and  D  is  diagonal.  Let  H  and  D 
be  independently  distributed  such  that  H  has  the  uniform  distribution.  (Diagonal  elements 
of  D  can  have  any  distribution .)  Then  V  has  an  orthogonally  invariant  distribution. 
Conversely  all  orthogonally  invariant  distributions  can  be  obtained  in  this  way. 

Proof:  The  first  part  of  the  lemma  is  obvious.  To  prove  the  converse  suppose  that  Vhas  an 
orthogonally  invariant  distribution.  Now  we  form  a  new  random  matrix  V  =  HVH*  where 
H  has  the  uniform  distribution  independently  of  V.  Then  V"has  the  same  distribution  as 
V'because  for  any  Borel  set  A 

Pr{V£  A)  =  ^{PriHVH1  £  A  |  H)} 

(3-57)  =  £h{Pt{V£  >1)}  =  Pr{V£  A). 

Now  we  evaluate  Pr(V £  A)  by  conditioning  on  V.  For  fixed  V" we  can  write  V  —  TDr' 
where  T1  is  orthogonal  and  D  =  diag(c?i, . . . ,  dk).  We  require  di  >  •  •  •  >  dk  then  D  = 
D(V)  is  unique.  Then 

Pr{V£  A\V)  =  Pr{HrD{V)r'ir  £A\V) 

(3-58)  =  Pr{HD{V)Hl  £  A  |  V). 

Note  that  we  replaced  Hr  by  H  since  Hr  has  the  uniform  distribution.  Hence 

Pr{V£  A)  =  £v{Pr{V£  A  [  V)} 

(3.59)  =  £v  { Pr(HD(V )Hf  £  A  \V)} 

=  Pr{HD[V )fT  £  A). 


This  proves  the  lemma.  | 

Remark  3.9.  Note  that  the  set  of  orthogonally  invariant  distributions  is  convex  with 
respect  to  taking  mixture  of  distributions.  Lemma  3.8  implies  that  the  extreme  points  of 
this  convex  set  are  given  by  those  distributions  for  which  D  is  degenerate. 

We  can  replace  H  in  Theorem  3.2  by  U  whose  elements  are  independent  normal 


variables. 


so 
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Theorem  3.4.  Let  U  =  ( Uij )  be  a  kXk  matrix  such  that  Uij  are  independent  standard 
normal  variables.  Then  for  symmetric  A,B 

(3.60)  £uyp(AUBUt)  =  J^yp(A)yp(B). 

Proof:  By  Lemma  3.7  U  =  TH.  Then 

£T£Hyp{ATHBH’T') 

£T£Hyp{T'ATHBH ') 

£Typ(T'AT)yp(B)/yp(ik) 
£Typ(ATT')yp(B)/yp(ik) 

We  used  the  fact  that  TT'  =  UV  ~  k).  | 

Theorem  3.4  leads  to  the  following  important  observation. 


hy^AUBU1)  = 


(3.61) 


Theorem  3.5.  bp  —  \kp/yp{Ik)  is  a  constant  independent  of  k. 
Proof:  .  Let  A,B  be  augmented  by  zeros  as 


~  fA  °\ 

A  =  ,  kiX  ky, 

\o  o) 


k i  X  ky. 


Then  yp{A)  =  yp{A),  yp{B)  =  yp{B),  and  yp{AUBU')  =  yp{AUBU)  where  U  {ky  X 
ky)  is  obtained  by  adding  independent  standard  normal  variables  to  U.  Therefore 


(3.62) 


j^yp(A)yr(B)  =  £vyr(AUBV) 

=  e0yMUBu’) 

- 


Hence  \kp/yp{Ik)  =  Xfcip/l/p(lfcl).  fi 

We  evaluate  the  bp  s  in  Section  4.2.  Corresponding  to  Theorem  3.3,  Theorem  3.4 
can  be  generalized  as  follows. 
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Theorem  3.6.  Let  X  be  a  k  X  k  random  matrix  (not  symmetric)  such  that  for  every 
orthogonal  l"i,  A,  the  distribution  of  A-X"A  is  the  same  as  the  distribution  of  X.  Then 
for  symmetric  A,B 

(3-63)  £xyp(AXBX)  =  lpyp{A)yp{B), 

where 

P-64)  %  =  £x{yr{xx'))i{yp{ik)}\ 

Proof:  For  any  orthogonal 

(3.65)  exyP{AXBX)  =  £xyp(AxriBrv  • x y). 

Letting  Pi  be  uniformly  distributed  we  obtain 

exy^AJCBx1)  = 

P-66)  U,m 

-  mtxMAxT)- 

Now  V=  JTX'  has  an  orthogonally  invariant  distribution  because  AVA '  =  (A-X)(A-Xy. 
Therefore  by  Theorem  3.3 

(3.67)  £xyp(AxX)  =  yp{A)£x  {yp{xX])i  yP[ik)- 

Substituting  (3.67)  into  (3.66)  we  obtain  the  lemma.  | 

Remark  3.10.  We  call  the  distribution  of  X  “orthogonally  biinvariant”  if  it  satisfies' 
the  condition  of  Theorem  3.6. 

Corresponding  to  Lemma  3.8  we  have 

Lemma  3.9.  Let  X  —  H\DH%  where  H\,H2  are  orthogonal  and  D  is  diagonal.  Let 
Hi,  H2,  D,  be  independently  distributed  such  that  Hi, H2  have  the  uniform  distribution. 
(D  can  have  any  distribution.)  Then  X  has  an  orthogonally  biinvariant  distribution. 
Conversely  all  orthogonally  biinvariant  distributions  can  be  obtained  in  this  way. 

The  proof  is  entirely  analogous  to  the  proof  of  Lemma  3.8,  therefore  we  omit  it. 
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Remark  3.11.  The  notion  of  orthogonal  biinvariance  can  be  applied  to  rectangular 
matrices.  If  X  is  k  X  m  in  Theorem  3.6  we  obtain 


(3.68) 


sxy,  (xy) 

'  yr(h)yr{i„) 


and  in  Lemma  3.9  (for  k  <  m)  we  replace  X  —  H\DHi  by  X  —  H\(D,  0)Jf2 . 

In  the  sequel  we  almost  exclusively  work  with  the  Wishart  and  the  normal  dis¬ 
tributions.  But  in  view  of  Theorem  3.3  and  Theorem  3.6  there  could  be  other  distributions 
which  give  information  on  various  aspects  of  zonal  polynomials. 


§3.3  An  integral  representation  of  zonal  polynomials 

We  prove  an  integral  representation  by  Kates(1980)  which  shows  that  (i)  zonal 
polynomials  are  positive  for  positive  definite  A  and  increasing  in  each  root  of  A,  (ii)  in 
the  normalization  Zp  defined  below  the  coefficients  apq  in  Zp  —  J2apqMq  are  nonnegative 
integers.  The  derivation  by  Kates  is  rather  abstract  but  the  integral  representation  can  be 
proved  directly  in  our  framework.  The  representation  can  be  formulated  in  several  ways. 
James(1973)  derived  one  involving  uniform  orthogonal  matrix.  We  discuss  these  variations 
in  Section  4.7. 

From  Theorem  3.5  we  see  that  a  constant  bp  or  equivalently  the  value  of  a  zonal 
polynomial  at  I k  describes  a  particular  normalization.  The  normalization  Zp  is  the  simplest 
one  in  this  sense. 

Definition  3.3.  A  particular  normalization  of  a  zonal  polynomial  denoted  by  Zp  is 
defined  by 

(3-69)  Zp(Ik )  =  Xfcp, 

or  bp  =1  in  Theorem  3.5. 

Theorem  3.7.  (Kates,  1980)  Let  p  =  (pi, . . .  ,Pe).  For  k  X  k  symmetric  A 


(3.70) 


ZP(A)  = 


3.3.  An  integral  representation  of  zonal  polynomials 
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where  A*  =  UAV{1, . . .  ,i)  is  the  determinant  of  the  upper  left  minor  of  UAU*  and  U  is 
a  k  X  k  random  matrix  whose  entries  are  independent  standard  normal  variables. 

Proof :  Let 

■ 

(3.71)  ~Zp{A)  =  £u{  Af-p*-..AH. 

It  can  be  routinely  checked  that  (3.71)  is  a  homogeneous  symmetric  polynomial  of  degree 
n  =  |p|  in  the  roots  of  A.  Furthermore  augmenting  A  to  A  (ki  X  hi)  by  adding  zeros  and 
augmenting  U  to  U  by  adding  independent  standard  normal  variables  does  not  change  the 
upper  left  part  of  UALJ1 .  Therefore  (3.71)  does  not  depend  on  k.  Hence  ZP(A)  6  Vn(A). 
Now  we  want  to  show 

(3-72)  ^{Z^A)}  =  KPZP(A). 

Let 


(A  0\ 

1  :  v  X  v 

\°  0) 


and  W  —  YY  where  Y"is  a  v  X  v  matrix  whose  entries  are  standard  normal  variables. 
Then 

t„{Zp{A)}  =  £^{ZP(AW)} 


(3.73) 


=  fy £„{ n  [UYAYU'(1, . . . , l-)]Pi-p.+x } 

»=i 

t 

=  Su£Y{J[[YUAUlY(l,...,i)}’’<-’’‘+'}. 


»=i 


We  switched  U  and  Y  because  they  have  the  same  distribution.  Now  by  Lemma  3.7  Y— 
TH  and  H  can  be  absorbed  into  U.  Therefore 


(3-74) 


Su  &  ( II  (YUAU'Y(  1, . . . ,  i)]Pi"Pi+1 } 

»=i 

t 

*= i 
t 

=  So  £t{  n  (*?i  ■  •  •  4)Pi-Pi+‘  [UAU'(1, . . . ,  i)]"  } 


»=i 

t 


=  ^pSo{  TllUAU'(l,...,i)}"-P^} 

»=i 

=  Xl/pZp(A). 
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Hence  ZP(A)  is  a  zonal  polynomial  by  Theorem  3.1.  Putting  A=  Ik  we  obtain 
(3.75)  Zp(Ifc)  =  fw{W(l)Pl-p2--*W(l,...,i!r}, 

where  W —  'W(Iky^)-  Again  by  the  triangular  decomposition  W  =  IT'  (Lemma  3.3)  we 
obtain  Zp{Ik )  =  X kp •  Therefore  ZP(A)  =  ZP{A).  | 


Note  that  the  coefficients  of  the  monomial  terms  in  Zp  are  integers,  being  the 
expected  value  of  sum  of  products  of  independent  standard  normal  variables.  Furthermore 
if  A  =  diag(aj, . . . ,  ak)  then  by  the  Binet-Cauchy  theorem  (see  Gantmacher(1959)  for 
example) 


UAL 7,(l,...,r) 

(3.76) 


£  £ 

<1  <-<»,  Jl<  —  <Jr  V 

ii  <•••«,  V  \ 


jrj  V  •  •  •  >  r  J 


where  jB(*1,",’tr)  denotes  the  determinant  of  a  minor  formed  by  rows  i'i, . . . ,  ir  and  columns 
j\, ,  jr  of  B.  (3.76)  is  obviously  increasing  in  each  a,-  when  A  is  positive  definite. 
Furthermore  coefficients  for  monomial  terms  are  nonnegative.  These  points  are  discussed 
in  Kates(l980).  For  more  about  this  see  Section  4.1.  Generalizations  of  Theorem  3.7  will 
be  discussed  in  Section  4.7. 


§3.4  A  generating  function  of  zonal  polynomials 

One  of  the  main  contributions  of  Saw(1977)  is  his  generating  function  which  gives 
a  relatively  simple  way  of  computing  zonal  polynomials.  Clearly  (tr  C)n  £  Vn{C).  Therefore 
we  can  write 


(tr  O)”  =  E  ‘'pZptCO- 

p€Pn 


(3.77) 
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Let  C  =  AUBXf  where  A  =  diag(ai, . . . ,  a^),  B  =  diag(/?i, . . .  ,/3k)  and  the  elements  of 
U  are  independent  standard  normal  variables.  Then  by  Theorem  3.4 

£u{tr  AUBU>)n 

=  dpSuZpiAUBU’) 
p€Pn 

(3'78>  =  E  dpjjrjZp(A.)Zp(B) 

P€f. 

=  E  dpZp(^)Zp(B). 

pePn 


Therefore  for  sufficiently  small  0 


(3.79) 


£t/{exp(0  tr  AUBU*)} 

oo 

=  £u{^{0n  ln^{tx  AUBV)n} 

n~0 

oo 

=  Ef9"/"!)  E  dpZp{A)Zr(B). 

n— 0  p6  Pn 


On  the  other  hand 

k 

(3.80)  tr  AUBlf  = 

i,i 


Hence  for  sufficiently  small  6 


(3.81) 


k 

£v  {vx.p{0  tr  AUBTJ1)}  =  £u{exp(0  a,^)} 

i,3 

k 

=  11(1-2 

*p  i 


From  (3.79)  and  (3.81)  we  obtain 


Theorem  3.8. 


k 


OO 


nil  -  =  E  (*7»'>  E  d pZp(A)Zp(B). 

i,j  n=0  p€Pn 


(3.82) 
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The  left  hand  side  of  (3.82)  can  be  expanded  as  follows. 
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JJ(1  -  2 Octipj)-* 


k 

=  exp{log  ^1  -  2 
*'»j 


(3.83)  =  exp{i  ~r-tr{A)tr{B)} 

r—1 

oo 

=  E  E 

n=0 


oo 

=  E  (*’/»!)  E  o,Tp(A)Tp(B), 

«  — 0  p€Pn 

where 


f*(p) 

(3.84)  cp  =  |P|!2lH-MP)m  rPr~Pr+ .(Pr  _  Pr+1)! 

The  fourth  equality  follows  from  the  fact  that  Tj,  being  a  product  of  pi  terms  comes  only 
from  the  p\ -th  power  term  in  the  expansion  of  exp.  Comparing  the  coefficient  of  9n  in 
(3.82)  and  (3.83)  we  obtain 


(3.85)  X;  dpZp(A)Zp(B)  =  X  cpTp(A)Tp(B). 

pGPn  P€P" 

Note  that  cp  is  positive  for  every  p  £  Pn-  therefore  putting  A  =  B  and  regarding  (3.85) 
as  a  quadratic  form  in  Vn(A .)  we  see  that  (3.85)  is  positive  definite.  It  follows  that  dp  >  0 
for  every  p  £  Pn-  Now  let  D  =  diag {dp,p  £  P„),  C  =  diag(cp,p  £  Pn).  We  recall  that 
Z  =  BU  where  B  is  upper  triangular  and  T  =  FU  where  jFis  lower  triangular  (see(2.37)). 
Therefore  in  matrix  notation  (3.85)  is  written  as 


(3.86) 


U{A)'B'D3U{B )  =  U{A)'FCFU{B), 
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or 

(3.87)  S'DB  =  FCF. 

We  note  that  the  left  hand  side  and  the  right  hand  side  correspond  to  two  different  trian¬ 
gular  decompositions  of  the  same  symmetric  positive  definite  matrix.  F  can  be  computed 
from  (2.33)  or  alternatively  F  can  be  obtained  from  tables  given  in  David, Kendall, and 
Barton(1966)  for  n  <  12.  Therefore  we  can  compute  the  right  hand  side  of  (3.85)  rela¬ 
tively  easily,  then  we  decompose  the  resulting  positive  definite  matrix  as  B'DB.  Diagonal 
elements  of  S  corresponding  to  Zp  is  obtained  in  (4.33).  This  determines  D  and  B  uniquely. 

Remark  3.12.  Saw(l977)  defined  zonal  polynomials  or  the  coefficient  matrix  B  by 

(3.87)  and  derived  the  first  part  of  Theorem  3.1  from  this  definition.  It  seems  that  (3.87) 
should  be  looked  at  as  providing  a  convenient  algorithm  for  obtaining  B  rather  than  provid¬ 
ing  a  definition  of  S  because  it  lacks  the  conceptual  motivation  necessary  for  a  definition. 

Actually  dp  is  known  to  be  (James(1964),formula(18)) 

dP  =  X[2P]0)2nn!/(2n)! 

(3.88)  _  2wn!  rL<y(2p,  ~  2 Pj  -  i  +  j) 

n‘S(2w  +  Up)  -  01  ’ 

where  n  =  \p\  and  x[2p](l)  =  (2n)!  -  2p,  -  i  +  j )  /  nfilpK  +  Up)  -  »')!  is 

“the  dimension  of  the  representation  (2p)  =  (2pi, . . . ,  2p^p()  of  the  symmetric  group  on 
2 n  symbols.”  This  is  one  thing  we  were  unable  to  obtain  by  our  elementary  approach.  It 
was  obtained  by  James(196i)  using  group  representation  theory.  We  will  discuss  this  point 
again  in  Section  4.2  and  Section  5.4.  1 


dpZp  is  usually  denoted  by  Cp  so  that 
(3.89)  (tr  A)n  =  ^  CP(A). 

pePn 

This  notation  often  makes  it  simpler  to  write  down  various  noncentral  densities.  Our  last 
item  in  Chapter  3  is  related  to  this  point. 


Lemma  3.10. 


e&Aaf-l 


(3.90) 


3.  Derivation  and  some  basic  properties  of  zonal  polynomials 


38 


where  k  X  k  orthogonal  H  is  uniformly  distributed. 


Proof :  Let  the  singular  value  decomposition  be  A  —  ABA  where  A>  A  are  orthogonal, 
D  —  diag(£i,  ...,6k)  and  0^  =  6?,  i  —  1, . . . ,  k  are  the  characteristic  roots  of  AA! .  Then 
(tr  AH)2n  =  (trABA-H)2"  =  (tr  BABA)  2"  and  A#A  has  the  same  distribution 
as  H.  Therefore  £#(tr  AH)2n  is  a  2  n-th  degree  homogeneous  polynomial  in  Si, ...  ,6k. 
Furthermore  the  order  of  6i,...,6k  and  the  sign  for  each  6,  are  arbitrary  in  the  singular 
value  decomposition.  It  follows  that  £#(tr  AH)2n  £  Vn{AA!).  Therefore  we  can  write 

(3.91)  £H{tr  AH)2n  =  £  apZp(AA'). 

pePn 

Now  let  A  =  diag(ai, . . . ,  a*)  and  JJ  —  («,_,)  be  as  before.  Then  tr  AU  —  Ylaiu a  is 
distributed  according  to  M(0,  52  ai  )•  Hence 


(3.92) 


£u[trAU)2n  =  (£  a,?)n  •  1  •  3-  •  ,-(2»  -  1) 


=  ^j(tr  AA') 
2nn!  ^  ’ 


l\n 


=  E 


PGPn 


(2  n)\dp 
2nn! 


ZP{AA!). 


On  the  other  hand  by  Lemma  3.7 

£u{trAU)2n  =  £T>H(ir  ATH)2n 

=  ^  ap£TZp{ATT'A!) 
(3.93)  pGPn 

=  E  «pWp(AA'). 

pe^r. 

Comparing  (3.92)  and  (3.93)  we  obtain  (3.90).  | 


Chapter  \ 

More  properties  of  zonal  polynomials 


This  chapter  is  a  collection  of  results  which  are  for  the  most  part  generalizations 
and  refinements  of  the  basic  results  given  in  Chapter  3.  We  do  not  attempt  to  collect 
various  known  identities  involving  zonal  polynomials.  For  this  purpose  the  reader  is  referred 
to  an  excellent  survey  paper  by  Subrahmaniam(1976).  Actually  in  the  discussion  of  the 
orthogonally  invariant  distributions  we  saw  that  zonal  polynomials  satisfy  an  infinite  num¬ 
ber  of  identities.  It  is  a  rather  frustrating  fact  that  although  many  identities  for  zonal 
polynomials  are  already  known,  explicit  forms  of  zonal  polynomials  are  not  known. 


§4.1  Majorization  ordering 

The  proof  of  Theorem  3.2  which  played  an  essential  role  for  the  subsequent 
development  in  Chapter  3  is  not  complete  as  it  is.  In  (3.49)  we  argued  that 

(4-4)  0  =  X>-0P  -  Koq)cqq>yq(s)yg/(B), 

9.9' 

for  any  B,B  implies  (XVop  —  X„o9)cgg/  =  0.  One  objection  may  be  that  27  is  restricted 
to  be  positive  semidefinite.  But  this  causes  no  trouble  since  (4.1)  is  a  polynomial  and  a 
polynomial  which  is  identically  zero  for  nonnegative  arguments  has  to  be  zero  everywhere. 
A  more  serious  question  is  that  the  dimensionality  of  27  and  B  is  fixed  to  be  k  X  k  which 
is  the  dimensionality  of  the  unilorm  orthogonal  matrix  H.  The  same  objection  applies  to 
the  proof  of  Lemma  3.10.  (In  the  case  of  U  whose  entries  are  independent  standard  normal 
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variables  we  could  augment  U  :  k  X  k  to  U  :  X  h  without  loss  of  generality  and  we 

could  avoid  this  difficulty.)  What  we  have  to  consider  is  •  •  • ,  2fc),the  space  of  n-th 

degree  homogeneous  symmetric  polynomials  in  x\,...,Xk  ( k  fixed)  and  we  have  to  show 
that  {yp,p  E  Pn  }  is  indeed  linearly  independent  and  forms  a  basis  for  Vnik{%i,  •  •  • » ®fc)- 
To  do  this  we  now  study  homogeneous  symmetric  polynomials  again  from  the  viewpoint  of 
majorization  ordering.  The  following  is  a  refinement  of  Lemma  2.5. 

Lemma  4.1. 

(4.2)  UP  =  MP  +  Y  apqMq, 

9-<Pi9^P 

(4.3)  MP  =  UP+  Y  aPqU<r 

q<p,<i^p 

Proof:  Let  a  =  X\  —  •  •  •  =  xr.  Then  the  degree  of  a  in  M,  is  qy  +  •  •  •  +  qr.  The  degree 
of  a  in  Up  is  pi  +  •  •  •  +  pr  because 

Up{ot,  .  .  .  ,  OL,  Xr+1>  «  •  •  ) 

=  (a  +  •  •  •  )pl~P2(a2  +  •  •  •  )P2~P3-  •  • 

•  (ar  H - )Pr- Pr  +  l  (arXr+1.  .  .  )Pr+l-Pr  +  Z  .  .  .  (a'Xr+l  •  •  -Xt  H - )P* 

'  ‘  '  —  co,(pi-P2)+2(pz-P3)+  —  +r{pr-Pr+i)+—+rpt  .  .  . 

=  caPl^  *~Pr  +  •••. 

where  c  is  a  term  not  containing  a.  Let  Qr  =  {q  |  q  <  p  and  qi  +  •  •  •  +  qr  >  Pi  + 

•  •  •  +  pr  }•  Now  since  the  degree  of  a  in  (4.2)  is  pi  4 - 4-  pr  we  have 

(4.5)  ^  ]  Opg  Atg((3!j  •  •  .  ,  Q!;  Xr+1>  •  •  •  )  0* 

q€.Qr 

Now  .M,(a, . . . ,  a,  xr+i, . . . ,  x*;)  are  linearly  independent  if  k  is  sufficiently  large.  Therefore 
apq  —  0  for  q  £  Qr.  Repeating  this  argument  for  r  =  1,  2, . . .  we  have 

(4.6)  opg  =  0  if  q  E  Q1UQ2U •••• 

But  if  q  is  not  majorized  by  p  there  exists  an  r  such  that  q  E  Qr-  Therefore  ap,  =  0  for 
every  q  which  is  not  majorized  by  p.  This  proves  (4.2).  (4.3)  can  be  proved  similarly.  | 
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Let  Vnik(xi,  —  ,xk)  be  the  vector  space  of  n-th  degree  homogeneous  symmetric 
polynomials  in  xit  xk  ( k  fixed). 

Lemma  4.2.  {MP,p€  Pn,£{p)  <  k},{Up,p  E  Pn,l{p)  <  k}  are  bases  ofVnik(x lf  ..., 

Xk). 

Proof :  Note  that  Mp(a:x, . . . ,  xk)  =  0,  Up(x1} . . . ,  xk)  =  0  for  p  such  that  l{p)  >  jfc.  Let 
/  £  Vn,fc(*i>  •  •  • ,  £*)•  Then  from  (2.11) 

f(xi,...,xk)  ^  ]  aPAtp(a;i> . . . ,  xk) 

(4.7)  p€Pn 

y .  aP-MP(a;i>**->*fc)- 

P6^n,^(P)<At 

Therefore  any  /  6  V„lfc(®i, . . . ,  xk)  can  be  written  as  a  linear  combination  of  Mp’s  for  which 
P  €  Pn,£[p)  <  A:.  Now  suppose 


(4-8)  a9Mi(  xt,...,xk)  =  0. 

gePn,i(9)<  k 

Then  differentiating  (4.8)  with  respect  to  x{  p<  times,  *  =  l,...,^(p),  (note  i(p)  <  k)  and 
setting  0  =  *!=••  •  =xkwe  obtain  ap  =  0.  Therefore  {  Mp,  p  E  PnAv)  <k}\s  linearly 
independent  in  Vn>k(xi, . . . ,  xk).  This  shows  that  {Mp,p  E  Pn,  4p)  <  k)  is  a  basis  of 
Vn,k(x  1, ... . ,  xk).  To  show  that  {  Up,  p  €  Pn,  £{p)  <  k  }  is  a  basis  it  suffices  to  observe 

Alp(xx, . . . ,  a;*.)  =  £/p(zi,  — ,  Efc)  +  ^  apqUg{xi, ,  xk) 

(4.9)  ?<p 

—  Up(x  1,  ...,Xk)+  22  aPqUq{Xl>  •  •  •  ,  a^fc). 

9  <P, *(?)<* 

This  and  (4.7)  with  /  replaced  by  shows  that  {Up,p  E  ,Pn,^(p)  <  £}  is  another  basis  of 

•  •  • ,  Xk).  | 


Remark  4.1.  It  is  known  that  apq  in  (4-2)  is  nonzero  and  positive  if  and  only  if  q  p. 
This  is  called  the  Gale-Ryser  theorem.  (See  Macdonald(1979),  Marshall  and  Olkin(1979).) 

Now  we  prove  the  following. 
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Theorem  4.1. 

(4-10)  ]/p  =  ^  apqUq  =  a>Pq^1 

q<p  q~ip 

and  {yp,p€  Pn,t(p)  <  k}  forms  a  basis  ofVnik  (*i, . . . , xjt). 

Proof:  We  first  note  that  majorization  is  transitive, i.e.  if  p1  >-  p2,p2  >-  p3  then  p1  >-  p3. 
Therefore  in  view  of  Lemma  4.1  the  equalities  involving  U’s  and  M’s  are  equivalent.  Hence 
we  prove  one  involving  U’s.  Now  as  in  the  proof  of  (4.2)  the  right  hand  side  of  (3.16) 

(4.H)  e 

*1  t'l<*2 

has  only  those  monomial  terms  M?(A)  for  which  q  p.  Therefore 


(4.12) 


ru[U.p)  —  53  &pg-Mg* 

.  q<p 


Substituting  (4.3)  into  (4.12)  and  using  the  transitivity  of  majorization 


WC  U  U  1/o.Hl 


(4.13) 


Now  let 


r^(Zip)  —  5^  b'pqUq. 
q-<p 


(4-14)  2/p  =  53  flpA- 

q<p 

We  want  to  show  that  Qp  —  {q  |  apq  ■=£  0,  q  not  majorized  by  p}  is  empty.  We  argue  by 
contradiction.  Suppose  that  Qp  is  nonempty.  Let  q*  be  the  highest  partition  in  Qp.  Then 
apg  0  and  q  >  q*  imply  q  -<  p.  For  any  such  q 

(4.15)  apqTv{Uq)  =  apq{  53  ^gg'^g'}* 

q’<q 

Now  q'  q  ,  q  -<  p  imply  q'  -<  p.  Hence  the  right  hand  side  does  not  have  Uq *  term.  It 
follows  that  Uq •  does  not  appear  in 

53  apqTv{Uq)' 

9*<?<P 


(4.16) 
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Obviously 


(4-17)  J2  aPiT^<i) 

9  <  9* 

does  not  involve  Uq»  term  either.  Therefore  the  coefficient  of  Uq-  in 


(4.18) 


Ti '(i/p)  —  ^2  aP9r‘'{^q) 

9  <P 

—  apq‘Ti>[Uq')  +  (4.16)  +  (4.17) 
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is  apq*\uq..  On  the  other  hand 


(4-19)  r„(yp)  =  \upyp. 

Therefore  the  coefficient  of  Uq-  on  the  right  hand  side  of  (4.19)  is  \upapq..  Taking  v  =  v0 
we  have  a  contradiction.  Therefore  Qp  is  empty.  This  proves  (4.10). 

To  prove  the  second  assertion  we  note  that  q  -<  p  implies  £(q)  >  i(p).  Otherwise 
Pi  +  •  •  •  +  pt(q)  <  pi  +  •  •  •  +  p^(p)  =  n  =  qi  +  •  •  •  +  qe^  and  this  contradicts  q  -<  p. 
Therefore  in  (4.10)  we  have  only  those  Uq's  for  which  l(q)  >  l(p).  Now  suppose  that  A  is 
k  X  k  and  k  <  l{p).  Then  every  Uq(A)  in  (4.10)  vanishes.  Hence 

(4*^9)  i/p(-^)  =  0  if  A  is  k  X  k  and  k  <  £{p). 

Now  write 


(4.21) 


Then 


(4.22) 


Up  =  E  a"!/,. 

9<P 

Up{xi}  ...,xk)=  ^2  apqyq{xi ”,Xk) 

9<P 

-  E 

9<pA  9)<fc 


Similarly 

(4-23^  yp(X  !»•••»**)  =  Y2  apqUq[xif->Xk)- 

9<P,^(9)<fc 

In  view  of  Lemma  4.2,  (4.22)  and  (4.23)  imply  that  {  j/p,  p  6  Pn,  /(p)  <  k  }  forms  a  basis  of 

Vn,k[x  l,...,Xk).  | 
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In  the  proofs  of  Theorem  3.2  and  Lemma  3.10  we  replace  all  the  summations  by 


(4.24) 


E 

p€PnAp)<k 


E 

*(»)<*.*(«')<* 


etc. 


Then  those  proofs  are  complete.  We  do  not  repeat  the  steps  of  those  proofs.  But  in  later 
proofs  we  will  be  careful. 


Remark  4.2.  Using  the  Gale-Ryser  theorem  (Remark  4-1)  and  (3.70)  it  can  be  shown 
that  a'pq  in  (4-10)  is  positive  if  and  only  if  q  -<  p.  This  is  stronger  than  Theorem  4-1- 

For  future  references  we  record  (4.20)  as  a  corollary. 

Corollary  4.1.  If  A  is  k  X  k  and  I(p)  >  k  then  yp[A)  =  0. 


§4.2  Evaluation  of  i yp[Ik) 

In  the  sequel  we  often  work  with  a  normalization  denoted  by  i yp  which  has  the 
leading  coefficient  1,  namely 

(4.25)  ,yp  =  up  +  £  iapqUq. 

9<P 

Advantages  of  this  normalization  will  become  clear  soon.  We  shall  evaluate  i  l/p(2fc).  From 
Thorem  3.5  we  know  that  j bp  =  Xfcp/  i yp[Ik)  is  a  constant  independent  of  k.  Therefore 
our  goal  is  to  obtain  i bp.  Now 

(4-26)  ibplyp(Ik)  =  \kp  =  Zp(Ik). 

Therefore  \bp  is  the  leading  coefficient  of  Zp.  This  was  needed  for  the  unique  decomposition 
of  the  left  hand  side  of  (3.87).  We  use  the  following  recursive  relation. 

Theorem  4.2.  If  A  is  a  k  X  k  symmetric  matrix,  then 

(427)  |A|,yp(A)=  ,yp+(i.)(A)> 

where  p  +  (1*}  =  (f!  +  l,p2  +  1, . .. ,  Vk  +  l,Pk+u  ■■■)€  P„+k,  n  =  |p|. 

Proof:  If  t{p)  >  k  then  i yp(A)  =  0  by  Corollary  4.1.  In  this  case  I(p  +  (lfc))  ==  £(p) 
>  k.  Hence  i3/p+(i*)(A)  =  0.  (4.27)  holds  trivially  in  this  case.  Now  let  l{p)  <  k.  Let 


4.2.  Evaluation  of  i^/p(.Zjfc) 


45 


[au...,ak)  =  \(A).  Clearly  |A|  i]/p(A)  £  Vn+kik(au  •  •  • , <**).  Therefore  by  Theorem  4!1 
we  can  write 


(4.28) 


|A|  ,yp{A)  =  J2  i yg(A). 

?6  Pn+kAv)<k 


Putting  AW  in  (4.28)  and  taking  expectation  with  respect  to  W(Jfc,  u)  we  obtain 

(4.29)  ev,{\AW\lyr{Am=  E 

qePn+k,e(q)<k 

Now  the  left  hand  side  of  (4.29)  is  equal  to  \A\  £w{  ,  Z/p(AW)  \W\}.  Absorbing  \W\  into 
the  Wishart  density  and  letting  W”be  distributed  according  to  1V(Ik,i/  +  2)  we  obtain 

\A\£W{  t]/P(AW)|Wl> 


=  H  ff 


r[^  +  3-0] 


*(p) 


(4.30) 


—  i  ii«*  n  riW'' + 3  _  *)i  «  tt  riw + 4(" + 3  -  oi  ,, .  . 

_«»+«TT  ’’[W  +  I +  !(•'+ 1-01,  ., 

~  M  +  i-o)  w  MA> 

~  ^*/,p+(i*)|A|  i]/p (A) 

=  x^,p+(ifc)  5Z  °?i  ^(^-). 

9G  A+fc|f(?)<fc 


Subtracting  (4.30)  from  (4.29)  we  obtain 

(4.31)  23  ag(Xi/fl  —  X^  p^^fcj)  i l/q{A)  —  0. 

qePn+k,  t(q)<k 

This  holds  for  any  kXk  symmetric  A.  Therefore  by  Theorem  4.1  a,  (X^,  -  X„  p+(lfc))  = 
0  for  every  q  such  that  q  £  Pn+k,Z[q)  <  k.  Taking  v=v0  we  obtain  aq  =  0  for  q  ^  p-f  (1*=); 
Now  comparing  the  leading  coefficient  in  (4.28)  we  see  ap+(lfc)  =  1.  This  completes  the 
proof.  | 


Corollary  4.2.  (Formula(129)  in  Jame,(1964))  Let  p  =  . pt)  and  p  -  (pi) 

=  (p  1  —  Pti  P2  —  pt,  ....  pi—  i  —  pi).  Then  for  an  ty.  t  symmetric  A 

<4-32)  i  yM)  =  W"  ,i /r-wiA). 


Proof:  \A\pt  ,3fp_(rS =  W‘-‘  .3/r-W)+„.,(A)  =  -  =  ,1/P(A).  | 
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Applying  Corollary  4.2  to  the  identity  matrices  of  appropriate  dimensionalities  we 
can  evaluate  \bp  in  (4.26). 

Theorem  4.3. 


(4.33) 


*(p)  »  i  i 

~{j  - 1)  +  Pi  -  Pi)Pi 

-Pi+ 1> 

.=1,  =  1 


where  (a)t  =  a(o  +  !)•••  (a  +  k  —  1). 


Proof:  We  prove  this  by  induction  on  the  length  of  p.  Let  l{p)  —  1,  namely  p  =  (pi). 
Then 


(4.34) 


MW  =  MW  =  1"‘  =  1- 


Therefore 


(4.35) 


ibp  =  \lphyp(h) 

—  1  •  3'  •  •  (2pi  —  1) 


which  is  of  the  form  (4.33).  Now  suppose  that  (4.33)  is  true  for  t[p)  =  k  —  1.  We  want 
to  show  that  then  (4.33)  holds  for  t(p)  =  k.  Let  p=(pi , . . . ,  p*)  and  p  —  (p£)=(pi  —  pt, 
P2  —  Pk,  . .  • ,  Pfc-i  —  Pk)-  Note  that  £(p  —  (pj;))  =  k  —  1.  Putting  I*  in  (4.27)  we  obtain 


(4.36) 

or 

(4.37) 


1 2/p(A)  =  i3/p-(rt)(*) 


i  °p 


^ kp 


(p£)\ 


k,p-(Ph) 


Using  the  induction  hypothesis 
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(4.38) 


* = ^  8  nd--  -  - » + <« - «)  -  <*  -  - 

= 2,-cf  i  n  n  d>-  -  - 1) + w  -  *)w_m, 


»= i  j—i 


2l1"  n*-i  rfe  +  i(k  + 1  -  j)]/r[j(*  + 1  -  J)] 

2if-(»J)i  n‘=‘,  r[p,  -  p,  +  J(t  + 1  -  i)l/r[ J(*  + 1  -  yjj 

= 2bi  n  n  (r  - ■ - ») + » - ».  w.  ■  n  «*  *  -  \{j  - 1)  +  Pi  -  Pk)pk 
k  i 


-*wnn<i*-io--i>+»-KW 


*= i  y=i 

Therefore  (4.33)  holds  for  k  =  ^(p)  and  the  theorem  is  proved.  | 

There  is  a  curious  fact  about  tbp.  Let  A;!!  denote  1  •  3*  •  •  k  or  2  •  4-  •  •  k  depending 
on  whether  k  is  odd  or  even.  Then  as  above  it  can  be  shown  by  induction  that 


(4.39) 


A  -  n  nW  - .• + *> -  DU. 


Now  (tr  A)n  =  EdPZp(A)  =  Y,dplbp  ,]/p(A).  From  (3.88) 


*(p) 


~vr/ 

dp  ibp  —  2 "n!  (2 —  2 p3-  —  i  +  j)/  JJ  (2p,  —  i  +  £(p))l 


*</ 


»=i 


(4.40) 


n|i5^^nW-i+«-i)!. 

- 2"nl  n  ^ -  - i'in/8^ ~i  +  em- 


This  is  very  similar  to  i6p  1  if  we  ignore  the  constant  2nnl.  In  Section  5.3  we  will  see 
that  in  the  complex  case  the  corresponding  quantities  dp,  x~bp  satisfy  an  exact  relation 
dp{  i bp)2  =  nl. 


§4.3  More  on  integral  identities 

In  this  section  we  evaluate  the  constant  cp  in  Theorem  3.3  for  several  distributions. 
The  first  one  is  the  inverted  Wishart  distribution.  See  Khatri(l966),  Constantine(1963). 
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Lemma  4.3.  Let  W  be  distributed  according  to  u),u  >  2 h(p)  4-  k  —  1.  Then 

(4.41)  £wyp(AW~1)  =  cpyp(A), 


where 


(4.42) 


_ tt  r[£(y -*  +  «)- wl 

P  M  r[*(y  -  fc  +  0]2w  • 


Proof:  Let  A  =  diag(aj, . . . ,  a*)  without  loss  of  generality.  We  look  at  the  monomial 
term  aPl  •  •  •  api  ( t  —  t(p)).  Then  as  in  (3.18)  its  coefficient  in  (4.41)  is 

(4.43)  £w{W^1(l)Pl-p2  W^l,  2)Pa“P3-  •  ■  W^(  1, . . . ,  £)p*}, 

which  has  to  be  equal  to  cp.  Let  W=  T'T  where  Tis  lower  triangular  with  positive  diagonal 
elements.  Then  analogous  to  Lemma  3.3  tu,i  =  1, . . . ,  k,  are  independently  distributed 
according  to  x(t/  —  k  +  i).  Then  and  T-1  is  lower  triangular  with  diagonal 

elements  reciprocal  to  the  diagonal  elements  of  T.  Therefore  W_1(l, . . . ,  r)=(fn*  •  •  trr)~2. 
Hence 

cP  =  £{tT?Pl--t7?P1} 

^  A  -  * + o  - 

.11  r[fr-k  +  t))2» 

i 

■  —  {  JJ  (^  —  k  +  i  —  2pi)[u  —  k  +  i  —  2pi  +  2)*  •  •  [u  —  k  +  i  —  2)}“l. 

»=l 


Lemma  4.3  implies  the  following  interesting  identity  which  is  briefly  mentioned  in 
Constantine(1966).  Let  p*  t  be  defined  by  (2.5). 

Lemma  4.4.  Let  A  be  an  t  X  t  positive  definite  matrix.  Then 


(4.44) 

where  s  >  h{p),t  >  i(p). 


MT 


yM-1) 

yP{it) 


y?:M) 

yr:.,w 


Proof:  Without  loss  of  generality  let  A=diag(ai, . . . ,  at).  Consider  |A|S  l/p(A  1).  Let 
(4.45)  yp(A-1)=^2apqMq(l/a1,...,l/at). 

9<P 


4.3.  More  on  integral  identities 


49 


Note  that  q  <  p  implies  h{q )  <  h(p).  Now  the  degree  of  1  /a,-  in  >[,(1/0;!,. . .  ,l/at)  is  /i(g). 
Hence  the  degree  of  l/a{  in  yp(A~x)  is  h(p).  Now  |A|3=  (ar--at)a  and  s  >  h[p).  We  see 
that  1/ai  is  cancelled  by  |A|3  and  |A|3  yp[A~x)  is  a  polynomial  in  (01, . . . ,  at).  Clearly  it  is 
symmetric  and  homogeneous  of  degree  st-\p\.  Therefore  |A|3  t/p(A_1)  6  Kt-lpl.tfai,  • . . , 
at).  By  Theorem  4.1  we  can  write 


(4.46) 


iAi3j/P(A-1)  =  y  h*yM. 


Replacing  A  by  AW  in  (4.46)  and  taking  expectation  with  respect  to  W(It,  t>)  we  obtain 
(4.47)  Sw{\AW\ayp{A-xW-x)}=  Y  f>qKqyq(A). 

q&P,t-\P\Aq)<t 

Now  proceeding  as  in  (4.30)  and  using  Lemma  4.3  the  left  hand  side  of  (4.47)  can  be 
evaluated  as 

\A\s£w{\w\°yp(A-lnr1)} 

—  1  IT  + 1  + 2s - *)]  ,  ,-i» 

~|A|2  n  ■  T[i(^ + i  -  oj  £w{yM  w  )} 

^(p)  11  r  1  / 


(4.48) 


=  |A|32S*  IT  rlHt/  +  1  +  2s  -  i)\  r[^(i/  +  28  -  t  + j)-py] 

Bi  rl^+.l-*)]  B  r[i(t,  +  2s-*  +  i)]2P;  ^p(A  } 

=  |A|3  V  (A~1l23t-|p|  IT  +  1  +  2g  -  t)| 

1  iyA  M  r[*(p  +  i-01 

1~j~  +  1  -  j')  +  a  -  pt_ j/+1) 


j'= 1 


HK*'  +  1  +  2s  —  J7)) 


=  IAI3V  fA-M23t-lpl  IT  r[Mt/  +  1-  0  +  a-Pt-»‘+i] 

1  l]/p(  ]  U  r[i(*  +  1-0) 

where  W~  W(It,  u  +  2s)  and  /  =  t  -  j  +  1.  Now  p*s>t={s  -  pt,s  -  pt_x,  . . .  ,s  -  Pi)  and 
(4.49)  X„p.  =  23t~lpl  f[  +  !-*)  +  *-  Pt-i+i] 

Bi  r[i(^  +  i  -0] 

Therefore  combining  all  these  things 

X 


(4.50) 


"'Pi.*  X/  b*yq{A)  —  ^.p.*,t  1) 

q€P.t-\p\,t(q)<t 

=  Sw{\AW\ayp{A-xW-x)} 
~  bq\l/qyq(A). 

qeP.t-\p\Ai)<t 
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It  follows  that  bq= 0  if  q  p*  t.  Therefore 

(4.51)  =  tK.,3 /,;..(A). 

Putting  A  =  It  we  obtain 

(4.52)  bp'  4  =  3/P(J*)/3/p.t(I*). 

I 

The  second  distribution  is  a  “multivariate  F”  distribution.  There  are  many  ways 
to  generalize  the  univariate  F  distribution  to  the  multivariate  case.  Here  we  work  with  the 
following  version.  For  other  generalizations  see  Johnson  and  Kotz(l972). 


Lemma  4.5.  Let  the  columns  of  X\  :  k  X  vu  X2  :  k  X  i>2  {"2  >2 h[p)  +  k  -  1)  be 
independently  distributed  according  to  M[0,E).  Let  ]W~X\ (.X2 X2 )  •  Then 


(4.53) 


l(p) 

ewiVMW))  =  ^ P II 


»=i 


r[|(t>2  —  k  +  i)  —  Pi] 

r[±(i/ 2  -k  +  i)]2p» 


yP[A). 


Proof:  Premultiplying  Xi,X2  by  2?  ^  we  can  take  E — Ik  without  loss  of  generality. 
Then 


(4.54) 


£wyp(AW)  =  eXlexMx1Ax'1(x92Ct)-i) 

TT  r  [\{V2-k  +  i )  -Pi]  r  ^  T  X 

“  M  m*  -  k + oj2« x*p]/p( 


Remark  4.3.  It  is  more  or  les3  obvious  to  prove  Lemma  4-5  for  other  definitions  of 
multivariate  F  distribution. 

Our  last  distribution  is  multivariate  beta  distribution  (Constantine(1963)).  The 
following  derivation  is  essentially  the  same  as  in  Constantine(1963),  but  a  little  bit  more 
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“probabilistic”.  Let  Wi,W2  be  independently  distributed  according  to  1V(£,  v2) 

{E:kX  k)  respectively.  Note  that  W=WX  +  W2  ~  W{U,ut  +  v2).  Now  the  conditional 
density  of  W\  given  W  is 

f{Wi  |  W) 

_  c\wi\tl~^~L  cxp(-^  tr  S-'W^IW-  Wi  exp(-l  tr  ZT^W-  Wt)) 

(4.55)  |W|  '+  2  exp(—  i  tr  E~lW) 


\W\ 


+j*  2  —  1 


where 

(4.56) 


c  = 


nf-»r[lfo+ «*-»  + 1)1 


Tr^fc-D/4  nL,  r[iK  -  i  +  l)]r[i(ix2  -  i  + 1)]  ’ 

Note  that  terms  involving  £  cancel  out  in  (4.55)  .  Therefore  the  conditional  distribution 
does  not  depend  on  £.  When  W=J,  /(Wi  |  I)  is  called  multivariate  beta  density: 


(4.57) 


f(W1  |  I)  =  c|W1|!^|I-  w1|i^ 


Since  this  density  is  orthogonally  invariant  the  conditional  distribution  of  W\  given  W=  I 
is  orthogonally  invariant.  Now  we  want  to  evaluate  cp  in 

(4-58)  €  {yp{AWx )  |  W=I}  =  cpyp[A). 

For  a  positive  definite  A  let  A2  =rDir’  where  T  is  orthogonal  and  D  is  diagonal  in 
A  .  Now  the  conditional  distribution  of  A2  WXA.2  given  I'V  =  I  is  the  same  as  the 

conditional  distribution  of  Wx  given  W  —  A.  This  follows  from  the  above  mentioned  fact 
that  the  conditional  distribution  does  not  depend  on  £.  Therefore 

(4-59)  e{yp{Awx)  |  w=  1}  =  e{yp(wx)  I  a}. 

Letting  A  =  Wx  +  W2  we  obtain  from  (4.58)  and  (4.59) 

(4-6°)  e{yP{w. 1)  |  wx  +  w2}  =  cpyp{wx  +  w2). 

Now  taking  unconditional  expectation  we  obtain 

(4.61) 

^ipVv  (^)  —  cp^vi +v2,PyP  (£). 


Hence  cp  —  X„lp/X„1+l,2iP.  Now  we  have  proved 


4.  More  properties  of  zonal  polynomials 


52 


Lemma  4.6.  Let  Wi  have  the  density  (4-57).  Then 
(4.62)  £wMAWl)  =  -h^E-y^A), 

At'l  +  V2  ,P 

Variations  of  the  above  three  lemmas  can  be  found  in  Khatri(1966),  Subrahmaniam(1976). 


§4.4  Coefficients  of  Uq  in  t/p 

In  this  section  we  study  coefficients  of  Uq's  when  zonal  polynomials  are  expressed 
as  linear  combinations  of  Uq  s.  For  definiteness  we  work  with  ioP9  in  iyp=Up+'$2  i apqUq. 
If  rank  A=l,2  all  the  relevant  coefficients  are  known  and  we  can  compute  l/p(A)  ex¬ 
plicitly.  We  review  this  first.  After  that  we  study  several  recurrence  relations  between 
the  coefficients.  When  rank  A  >  2  these  recurrence  relations  are  not  enough  to  compute 
the  values  of  zonal  polynomials  yp{A)  for  all  p.  Nonetheless  they  seem  to  be  very  useful. 
Coefficients  of  Atg’s  will  be  discussed  in  the  next  section  and  7^’s  in  Section  4.6.  We  discuss 
relative  advantages  of  various  bases  on  the  way. 


4.4.1  Rank  1  and  rank  2  cases 

If  A  is  symmetric  and  rank  A=1  then  A  has  only  one  nonzero  root.  Let  A  = 
diag(o:i,0, . . .  ,0)  without  loss  of  generality.  By  Corollary  4.1  yp(A)  =  0  if  £{p)  >  2. 
Therefore  only  onepart  partitions  p  =  (pi)  count.  Obviously 

(4-63)  1y{pi){A)  =  U{pi)(A)  =  a?. 

Therefore  in  this  case  zonal  polynomials  reduce  to  powers  of  aj. 

Now  suppose  rank  A=2.  Let  A=diag(ai,  a?,  0, . . . ,  0).  We  have  to  consider  only 
partitions  with  two  parts  p  =  [p\,P2)-  Now  we  use  Corollary  4.2: 

(4-64)  1 2/(Pl,Pa)(A)  =  |A|P2  l l/(Pl-P2)(A), 


where  (pi  —  p%)  is  a  onepart  partition.  Therefore  it  suffices  to  know  the  value  of  a  zonal 
polynomial  of  onepart  partition  evaluated  at  rank  2  matrix  A.  Actually  zonal  polynomials 
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of  onepart  partitions  are  known  explicitly.  If  we  let  =  l)yS2  =  •••  =  /Jfc= 0  in  (3.82) 
we  obtain 


(4.65) 


II  (1  -  2 *<«,■)-*  =  X)  («"/»!Mh4,W  !*(.)• 


1=1 


n=0 


Now  in  (3.77)  (tr  C)n  —  U(n)(C)  and  &(„)(C)  term  appears  only  in  Z(n)(C).  Therefore 
(tr  C)n=  i ]/(n)(C)  +  •  •  •  =  (1/  i&(„))  ^(^(C)  +  •  •  • .  Hence  d(n)=  We  have 


fid -»*)-»- 


(4.66) 

t=  1  n=0 

The  left  hand  side  can  be  expanded  as  follows. 
k 

XI  (1  —  2 Octi)-i  —  {1  -  (2 0ut  -  402u2  +  •  •  •  )}~* 


t=i 


(4.67) 


Ytior  T  Pl  \ 

,=o  peX  pl!22  2  VPi  —  P2> P2  —  p3j •  •  •  > Pt(p)J 


n=0  p6  A 

.  ^_lj(P2~P3)+(P4—  Ps)+—  ^  . 


This  follows  from  the  fact  that  Up  being  a  product  of  pi  terms  comes  only  from  pi-th  power 
term  in  the  expansion  of  (1  -  20m  +  •  •  •  )-1/2.  Comparing  (4.66)  and  (4.67)  we  obtain 

Z(n)  =  2*n!  (_i)(P2-P3)+(P4-Ps)+-.  Wpi 


(4.68) 


pePn 


(Pi  -P2)b--P£(p)! 


uv. 


Note  that  i6(n)  =  1 -3- •  •  (2n- 1),  2”n!=2-4-  •  •  (2n),  \A\^U{quq2)  (A)=W(,1+p,in+pi)  (A). 
Therefore  if  rank  A=2  we  obtain  from  (4.64) 


(4.69) 


JAl  —  2  '  4~  —  2?>2) 

i^(Pi,p2)l  J  1.3...(2pi-2p2-l) 

(I) 


J5  ‘  -  «)!«,!ahl+»*+"»(A)- 


See  formula  (130)  in  James(1964). 

If  rank  A=3  what  we  have  to  know  is  the  values  of  zonal  polynomials  of  twopart 
partitions  evaluated  at  a  rank  3  matrix  A.  Obviously  things  become  more  and  more 
complicated  as  rank  A  increases.  However  several  useful  recurrence  relations  on  the 
coefficients  can  be  obtained. 
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4.4.2  Recurrence  relations  on  the  coefficients 

We  present  here  three  recurrence  relations.  The  first  one  is  already  used  in  deriving 


(4.69). 

i 

Lemma  4.7. 

If  k  >  i{p),  k  >  i(q),  then 

(4.70) 

lflP9  =  lap+(lfc),g+(lfc)* 

Proof:  Let  A  be 

k  X  k.  Then 

(4.71) 

\A\  iJ/p(A)  =  |A|{HP(A)  +  Y  i aPgUA)} 

q<p,£(q)<k 

—  ^p+(l*)(A)  +  Y,  lapg^g  +  (l*)(A). 

g<pAg)<k 

By  Theorem  4.2 

(4.72) 

|A|  i yp{A)  =  1]/p+(tfc)(A) 

=  i/p+(1fc)(A)  +  Y  iap+(.P),g'Ug'(A)' 

g'<p+(P)Ag')<k 

Comparing  (4.71)  and  (4.72)  we  obtain  by  Lemma  4.2  iaP9  =  iap+(i*jig+(i*).  1 

Remark  4.4. 

Theorem  4-%  has  been  known  and  Lemma  4- 7  is  almost  an  immediate 

consequence.  However  it  does  not  seem  to  have  been  explicitly  stated. 

The  next  one  is  in  a  sense  conjugate  to  Lemma  4.7.  Let  p  —  (pi, . . . ,  pi)  €  Pn  and 
m  >  pi=h(p).  We  denote  by  (m, p)  the  partition  [m,p\,p2,  ••  -,Pt)  €  P n+m* 

Theorem  4.4.  Let  m  >  h[p).  Then 
dm 

(4.73)  - . —  l  J/(m,p)(ai>  •  •  •  > «fc+ 1)  =  ™\  ij/p(ai, . .  • ,  «*). 

"“fe+i 

Proof :  Let 

{m,p){al>  -  -  -  >  ak+l)  —  'y  1  a(m,p),g^g(al>  •  •  •  >  afc+l) 

g<(™,p),g€Pn+7n 

Y  a(m,p),(m, g')U(m,g’){al>  •  •  •  >  afc+l) 

(m,g  ')<(rn,p),g'eP„ 

+  Y  a{m,p),qUg{ai,...,ak+l). 

g  ePn+  m  ,h(g)<m 


(4.74) 
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We  differentiate  (4.74)  m  times  with  respect  to  ak+l.  Now  the  degree  of  afc+1  in 


(4-75)  Uq(au...,ak+l)  =  (^“»)9l_92(Xla*ai)92_93-” 

i<3 

is  qi  =  (gx  -  q2)  +  [q-2  -  q3)  +  •••  +  qe{g).  Therefore  the  terms  in  the  second  summation 
on  the  right  hand  side  of  (4.74)  drop  out.  Now  &(m, ,/)(«!,  ...,  ak+1)  is  a  product  of 
m=(m-q'l)+  •  •  •  +q'i(q>)  elementary  symmetric  functions  ur{ a1>v  .  ,ak+1)  which  are  linear 
in  afc+1.  Therefore  differentiating  U(miql)  m  times  we  are  left  with  the  term  where  each 
ur  is  differentiated  exactly  once.  Furthermore 

(4-76)  d^~^u^ai>--->ak+^  =  Ur~  i(at>  •••>“*)• 

Therefore  by  the  chain  rule  of  differentiation 


OOLk+l 


,/)(«!,...,  ftfc+l) 


(4.77) 


=  m!{^7"l(ai’  •  •  • » ak+l)}m~9^d~^u^>  •  •  • . 

—  ajfc)9l_,,2«2(ai, . . . ,  ak)q*~g*- •  • 

=  m!Uq>(a  x,...,  ajt). 


Let  J/(a i,...,ak)  —  (^m/^ajfc+i)J/(m,p)(«i>  •  •  • , «Jt+i)-  Then 

~  ^  Qm 

]/(«l,  •  •  •  ,  &k)  =  2_^  a(m,p),(m,q)T^l  ^(m,9)(“l,  •  •  •  ,  Ofc+l) 

(4.78)  9=Sp  fc+1 

m •  a(‘m,p),(m,q)Uq(ai,  . . . ,  ak). 

9<P 


We  replaced  q'  by  q  and  [m,  q)  <  (m,  p)  by  q  <  p  since  (m,  q)  <  (m,  p )  if  and  only  if 
Q  —  P-  We  want  to  prove  that  y  is  a  zonal  polynomial.  This  can  be  done  by  showing  that 
y  satisfies  the  condition  of  Theorem  3.1. 


Let  A  diag(aj, . . . ,  a^+j)  and  A\  —  diag(«i, . . . ,  ak).  Then  exactly  as  above 
we  obtain 
(4.79) 

gm  k 

=  ™!  £  <W>, <•».,) w[k  +  l)“-« (£)  W(*'i .  k  + 1 .  • 

k 

‘(  y  ^  Q:t'i  •  •  •  att(,)  W(ii, . . . ,  ii(q),  k  +  1)) 

ii  <  •"<*<(,) 
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Let  Wbe  partitioned  as 


(4.80) 


u>k+ 1 


where  Wk+i,k+i  —  W(A:  +  1)  is  a  scalar.  Let 

(4.81)  Wn-k+i  =  Wn  —  wk+iw'k+l/wk+i,k+i- 


Then  by  the  well  known  identity  on  the  determinant  of  partitioned  matrices  we  have 


(4.82)  W{ii,...,ir,k  +  1)  = 


Therefore  in  (4.79)  u>5c+i  *+1  =  wfc+i  fc+i  ^  comes  out  as  a  common  factor  and  we 

obtain 


(4.83) 


a«r+1 


y (m,p){-A-W *)  m.!  ^  a(m,p),(m,g)'u,fc+l,fc+l  ^11-fc+l) 

9<P 

=  «r+i,*+i^(AiWii.M.i) 


Now  if  W  is  distributed  according  to  ‘W(Ik+i,i')  then  Wk+i,k+i  and  Wii-t+i  are  inde¬ 
pendently  distributed  according  to  x2{v),  W(Tfc>i/  —  1)  respectively.  (See  Srivastava  and 
Khatri(1979), Theorem  3.3.5  or  Mardia,  Kent,  and  Bibby(1979),  Theorem  3.4.6.)  Therefore 
taking  expectation  with  respect  to  W 

dm 

(m,p)y  i-A-l)  —  (m,p)(al)  •  ••  J  afc+l) 

dm 

=  -i/(m,)(AW)} 

(4.84)  °  k+i 

=  £w{wT+i,k+iy{Aiwi-i-k+i)} 

Hence 

(4.85) 

Now 


ewy{AiWu.k+t) = (x„,(m,p) j 2"~r  r (|] 


,  „„+nr(?  +  ">)rf  r(iy  +  w) 

r(|)  U  r(*p)  ' 


(4.86) 
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Therefore 


(4.87) 


r(f +  m) 
r(f) 


=  x 


By  Theorem  3.1  we  conclude 


(4-88)  y  =  cyp. 

Comparing  the  leading  coefficient  we  obtain  c  =  m\  in  (4.73).  | 


Corollary  4.3.  If  m  >  h(p)  then 

(4.89)  i  apq  ia(*»,p),(m>5r)* 


Proof:  From  (4.73)  and  (4.77) 

m!(^P  +  i apqUq) 

i<p 

dm 

(4-9°)  =  m!  1  Vp  =  oTm  1 V  (m,p) 

°ak+ 1 

m\(Up  +  53  l°(»n,p),(m,<j)£/9)> 

9<P 

Therefore  (4.89)  holds.  | 

In  terms  of  the  diagram  of  p  Lemma  4.7  corresponds  to  adding  a  column  to  the 
left  of  the  diagram  and  Corollary  4.3  corresponds  to  adding  a  row  to  the  top.  In  this  sense 
they  are  conjugate”.  There  might  be  a  deeper  reason  for  this  conjugacy. 

x  •  •  •  •  X  X  X  X  X 

X  •  •  .... 

X  •  •  .  . 

X  •  .  . 

X 


Figure  4.1. 
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Our  third  recurrence  relation  follows  from  Lemma  4.4. 
Lemma  4.8.  If  s  >  h(p),s  >  h(q),t  >  ^(p),  t  >  ^(g),  f/ien 
(4.91)  iaP9  =  lflp; 


Proof :  From  Lemma  4.4 


(4.92) 


|A|M/p(A-1)  =  c1]/p;t(A), 


or 


(4.93) 


|A|*Up(A->)  +  2  i<UA|*ii,(A-‘) 

9<P.*(>?)<‘ 

=  c(i/p.(A)+  ldp-,q'Ugi  (A)), 

9'<PV(9')<* 


where  p*  =  p*t  and  c  =  i]/p(it)/  it/p*(It).  Now  let  A=diag(ai,  ...,  at).  Then 

|A|izr(A_1)  =  (ar.-  at)  2Z  - - - ~ 


(4.94) 


^  ^  ‘  ’  ’  ajt-r 

=  ttt-r(A). 


Note  that  (4.94)  is  true  for  r  =  0,  t  if  we  define  uo  =  1.  Therefore 

|A|SZ/,(A"1)  =  |A|3-^{|A|^«1(A-1)^-^...^(,)(A-1)^)} 

(495)  =  «t(A)3-^«t_1(A)’>-^...«t_,(,)(A)^) 

=  Ug:M), 

because  q*t  —  (s, . . . ,  s,  s  —  qt(q), . . . ,  s  —  q%,  s  —  qi)  and  £(g*t)  —  t.  Substituting  (4.95)  into 
(4.93)  we  obtain 


(4.96) 


lfp.(A)  +  21  iaP<A:.t(A) 

9<p,<(q)<t 

=  c{^p.(A)+  2Z  i«PvMA)}. 

q'  <p*  ,l{q')<t 


Therefore  by  Lemma  4.2  iaP9  =  iop*  9;  ,  c=l.  fl 
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Remark  4.5.  Again  this  lemma  is  much  easier  to  grasp  in  terms  of  the  diagram.  See 
Figure  2.2. 

Looking  at  Table  2  in  Parkhurst  and  James(1974)  we  find  that  the  above  three 
recurrence  relations  give  a  lot  of  coefficients  without  any  calculation  (except  that  the  table 
is  for  Zp  rather  than  for  i  t/p).  However  it  seems  to  be  still  a  long  way  to  obtain  all 
coefficients  along  this  line. 


§4.5  Conversion  to  coefficients  of  Mq  and  James’  partial  differential  equation 

So  far  we  have  been  mainly  working  with  Up  s.  But  in  view  of  Lemma  2.5,  Lemma 
4.1  etc.  we  could  have  worked  with  Mp’s  as  well.  We  defined  zonal  polynomials  in  con¬ 
nection  with  the  Wishart  distribution  and  it  was  more  straightforward  to  define  zonal 
polynomials  in  terms  of  Up  s  in  that  setting.  But  when  it  comes  to  obtaining  coefficients 
it  seems  easier  to  work  with  Afp’s.  In  this  section  we  translate  every  result  in  Section 
4.4  into  the  coefficients  of  Mp’s.  Another  big  advantage  of  working  with  monomial  sym¬ 
metric  functions  is  a  partial  differential  equation  by  James(1968),  from  which  he  derived  a 
recurrence  relation  on  the  coefficients  of  monomial  symmetric  functions  in  a  zonal  polyno¬ 
mial.  (Note  that  the  recurrence  relations  of  Section  4.4.2  were  on  the  coefficients  of  Uq’s  in 
different  zonal  polynomials.  Here  the  recurrence  relation  is  on  the  coefficients  in  one  zonal 
polynomial.)  Actually  it  is  possible  to  develop  a  whole  theory  of  zonal  polynomials  from 
the  partial  differential  equation.  This  is  done  in  a  recent  book  by  Muirhead(1982)  explicitly 
and  illustratively.  We  discuss  the  partial  differential  equation  and  the  recurrence  relation 
in  Section  4.5.4. 

Furthermore  Jacob  Towber  (personal  communication)  has  recently  obtained  a 
combinatorial  method  for  determining  the  coefficients.  His  method  involves  several  steps  of 
counting  related  to  the  diagram  of  a  partition.  At  the  moment  the  combinatorics  involved 
seems  to  be  too  complicated  to  obtain  an  explicit  formula  for  the  coefficients,  but  it  might 
be  carried  out. 

From  the  above  discussion  we  see  that  we  have  much  more  information  on  the 
coefficients  of  ,MP  s  than  on  the  coefficients  of  Z/p’s.  Therefore  in  a  sense  it  is  pointless 
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to  work  with  Up's  any  more.  However  from  a  computational  point  of  view  it  is  easier  to 
compute  Ug’s  once  we  obtain  the  characteristic  roots  and  the  characteristic  equation  of  a 
matrix  A.  We  simply  multiply  the  elementary  symmetric  functions.  In  the  case  of  Atp’s  we 
have  to  multiply  the  roots  in  all  possible  ways  and  sum  them  up.  The  relative  advantages 
of  Mp’s  and  Up's  should  be  judged  from  this  viewpoint  too. 


4.5.1  Rank  1  and  rank  2  cases 

Let  p  —  (n)  be  a  oncpart  partition.  To  express  Z(n )  in  monomial  symmetric 
functions  we  can  use  the  integral  representation  by  Kates.  This  was  done  by  Kates(1980). 
Letting  r  =  1  in  (3.76)  we  obtain 


k 

(4.97)  UAV(1)  = 

i=i 

where  A  =  diag(«i, . . . ,  a*)  and  uu,  i  —  are  independent  standard  normal 

variables.  Therefore  by  (3.70) 

k 

(4.98)  Z(n)  =  £(£,  «.<)”• 

»=i 


Now  the  coefficient  of  a?1  •  •  •  a*1  on  the  right  hand  side  is 


(4.99) 


Therefore 


(4.100) 


ZW  =  »!2-  £  M„(A)  n  (H 

pepn  .= i'p'J 


This  looks  nicer  than  (4.68).  i]/p  has  the  leading  coefficient  1,  so 

tip) 

Mp 

pePn 


(4.101) 


. -C-T's-nO 
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Now  let  A —  diag(o:1,  a2)-  Then  for  q={qi,q2)  [q\  q2) 

|A|fcMg(A)  =  (ai  a2)k[al1al2  +alla{2) 

(4.102)  =*r+k<x?+k+*q2i+k*r+k 

"  M(qi+k,q2+k){A)' 


(The  equality  of  the  extreme  left  and  the  extreme  right  hand  sides  holds  for  qi=q2  too). 
Therefore  from  (4.64)  we  obtain 


(4.103) 

t3/(pi,P2)(«i»as)  =  (2pi  2p2>) 
V  Pi  —  P2  J 


-1 


E 

(^1  — P2 


CTCT 


■^(9i+P2.92+P2)(®1>  ^2)" 


This  takes  care  of  rank  1  and  rank  2  cases. 


4.5.2  Again  on  the  generating  function  of  zonal  polynomials 

lo  express  Tp  in  terms  of  .Mg’s  we  can  simply  expand  it  and  count  various 
monomial  terms.  Therefore  it  seems  easier  to  express  the  right  hand  side  of  (3.85)  in  M,’s 
than  in  Uq’s.  Then  we  decompose  the  resulting  positive  definite  matrix  as  LL'  where  L 
is  lower  triangular  with  positive  diagonal  elements.  The  elements  of  L  give  the  desired 
coefficients.  The  development  on  page  34  goes  through  in  exactly  the  same  way  except  that 
we  order  {Tp>p  G  Pn  }  according  to  the  lexicographic  ordering  of  the  conjugate  partition 
p'  (see  Remark  2.3).  We  do  not  repeat  it  here. 

Rather  we  notice  here  the  similarity  between  two  generating  functions  (3.82)  and. 
(4.66).  Let  71,  ...,  7*2  denote  the  k2  numbers  atf,-,  i  =  1  j  =  1  Let 

C=diag(7i,  ...,  7^2).  Then  from  (3.82)  and  (4.66)  we  have 

00  00 

(4.104)  E  (*"/»!)  E  dpZp[A)Zp{B )  =  £  (*” /n!)Zw(C). 

n=0  PeP„  n=0 

Hence 

E  dpZp(A)Zp(B)  =  Z(n)(C). 

pePn 


(4.105) 
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Now  we  can  substitute  (4.100)  into  the  right  hand  side.  Then  it  reduces  to  expressing 
■Mp(C)  as  a  sum  of  products  M9(j4) This  seems  nicer  than  directly  expanding  the 
right  hand  side  of  (3.85). 

Finally  we  prove  that  the  coefficient  of  in  Zp  ,  p  €  Pk  is  k\.  This  is  stated 

in  James(1968). 

Lemma  4.9.  Let  p  £  Pk  and  A=  diag(ai, . . . ,  a*).  Then 

dk 


<4'106»  5,,**...  Bat 

Hence  the  coefficient  of  in  Zp  is  k\. 


ZP(A)  =  k\. 


Proof: 

(4.107) 


11(1  -»«<&)-*  =  Et"”/"')  E  ipZM)Zr(B). 

P&Pu 


i,j  n=0 

Differentiating  this  by  <*1,0:2,. . . ,  a*  we  obtain 
k  f  k  nn  \  k 


fllSHSist)®--.)-' 


(4.108) 


n=0 


pEPn 


Now  0fij/(l  —  2 6a.if3j)  =  0f3j  +  higher  order  in  0.  Hence 

(4.109)  |J  T-  2Qa  ffi^  =  °k ^ 53  PA*  +  hiVher  order  in  °- 

Comparing  the  coefficients  of  0k  we  obtain 

k 


(4.110) 

But  by  (3.77) 

(4.111) 


‘Ei«‘=E^-m^t(4 

3=1  pePk 


3=i  pePk 


Comparing  (4.110)  and  (4.111)  we  obtain 

dk 


dav  --dak 


ZP(A)  =  k ! 


I 
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4.5.3  Recurrence  relations  of  Section  4.4.2 

Here  we  work  again  with  the  normalization  jl/p.  Let 
(4.112)  i3/,  =  Mp+X  iM,- 

9  <P 

Lemma  4.10.  If  k  >  l{p),k  >  l(q),  then 

(4-113)  lip,  =  lip+(lfc)i<H-(l*:). 

Proof:  Let  A 

(4.114) 

Note  that  this  equality  does  not  hold  for  augmented  monomial  symmetric  functions.  The 
equality  above  holds  because  the  summation  is  over  distinguishable  terms  and  a 
is  distinguishable  from  a?'  ■  •  •  a q-Lt  if  and  only  if  (a^-  •  •  £*&)£*’*•  •  •  a?*  is  distinguishable  from 
(«i  -  •  •  afc)ay,‘  •  •  •  <*£•  For  augmented  monomial  functions  refer  to  (2.15).  Now  the  lemma  can 
be  proved  just  as  Lemma  4.7  if  we  replace  Up,Uq,  i apq  in  (4.71)  by  Mp,Mq,  i bpq  respectively. 
I 

Lemma  4.11.  Let  h{p)  <  m.  Then 

(4.115)  1  bpq  lb{m,P),{m,q)’ 

Proof:  The  degree  of  afc+1  in  M,(ai, . . . ,  ock+l)  is  h(q).  Hence  if  h(q)  <  m,  then 

am 

(4-116)  gam+i  ■  •  • ,  ak+i)  =  0. 

If  h{q)—m  let  q=(m,  q').  Then  clearly 

dm 

(4-117)  om  -Mq(«i,--.,«fc+1)  =  m!M,'(ai, . . . ,  a/c). 

oak+ 1 

(Again  this  equality  does  not  hold  for  AMq.)  Now  (4.90)  holds  with  i bpq  replacing 

Up,Uq>  i apq  respectively.  This  proves  the  lemma.  | 


—  diag(a1,...,ajlc)  where  k  >  l(p).  Then 


|A|At,(A)  =  (a,...afc)  £ 

(*"i  >•••»*’<)  C(l,...|fc) 

-  £ 

(*t . *t)C(l,....A:) 

=  -M9+(.ifc)(A). 


af 1  a1-2  •  •  •  a?1 
*1  *2 


9.  +  1  92  +  1. 

tl  t2 
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Lemma4.12.  If  h(p)  <  s,h(q)  <  s,  £(p)  <  t,  t[q)  <  t,  then 
(4-118)  i  bpq  =  lbPl,  t,?.v 


Proof:  Let  A  =  diag(aq, . . . ,  at),  q  =  (qi, . . . ,  qt),qi  <  s,  l  <  t.  Then 

\ — '  1 


|A|s.Mg(A_1)  =  (ai---at)s 


a?.1- 


(4.119) 


S— 9l/v3- 92.  •  • 
*1  »2 


O', 


91 

it 


7!  ’*a;.  ’*•••<* 


a 


ji 


■  a 


Jt-* 


where  1  <  jq, . . .  ,jt-t  <  t  are  indices  not  included  in  (»i, . . . ,  i*)  and  g*t=(s, . . . ,  s,  s  —  qi, 
. . . ,  s  —  q2,  s  —  qi).  (Again  (4.119)  dos  not  hold  for  AMq).  Now  (4.93), (4. 95), (4. 96)  hold  with 
ibpq  replacing  Up,Uq,  \apq  respectively.  This  proves  the  lemma.  | 


We  have  shown  that  the  recurrence  relations  of  Section  4.4.2  hold  in  exactly  the 
same  way  for  the  coefficients  of  Uq’s  as  for  the  coefficients  of  Mg's. 

In  the  next  section  we  discuss  James’  partial  differential  equation  and  a  recurrence 
relation  derived  from  it.  The  mathematical  development  will  be  somewhat  sketchy. 


4.5.4  James’  partial  differental  equation  and  recurrence  relation 


James(1968)  derived  a  partial  differential  equation  satisfied  by  a  zonal  polyno¬ 
mial  from  the  fact  that  a  zonal  polynomial  is  an  “eigenfunction  of  the  Laplace-Beltrami 
operator.”  Let  A=diag(c*i,  p—[p i,  •••,  Pe)  6  JVThen  his  partial  differential 

equation  is 


<4-12°)  E  +  E  =  (E  w(*  - «' + *  -  0W4 

*=1  *  ’  3  ’  *=1 


This  might  seem  a  little  bit  strange  because  it  depends  on  the  number  of  variables  ( k 
appears  in  the  summation  on  the  right  hand  side).  Let 


4.5.  Conversion  to  coefficients  of  Mq  and  James’  partial  differentia!  equation 
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Then  the  right  hand  side  of  (4.120)  can  be  written  as 


(4*122)  °i(p)]/p(A)  +  n{k  -  1  )yp{A),  n  =  |p|. 


To  get  rid  of  n(k  -  1)J/P(A)  we  notice  the  fact  that  for  any  /  £  Vn(A),  at(d/dat) 

f=nf.  To  prove  this  it  suffices  to  show  that  £  (S/dai)Mp(A)=nMp(A)  for  any  p  £  Pn. 
Now  this  follows  from 


(4.123) 

Therefore 


E 


a 


1  dai 


.  npi _ 

ai 


—  (Pi  H - +  Pe)a !*• 


a 


pi _ 


na 


pi  . 


■  a 


pi 


<4-124>  (*  -  1 M,(A)  =  <*  -  1)  E  J(P(A). 

i=i  * 

But  we  can  write 


(4.125) 


( k  1]  S  =  22  ai~yp(A) 

*=1  3  =  1  ' 


Now  subtracting  (4.125)  from  both  sides  of  (4.120)  and  using  the  relation  a?/(a<  -  ay) 
— at=atQ!J/(al'  —  ay)  we  obain 


(4.126) 


E  + E  = «,wi/p(a), 


i=i 


which  does  not  involve  A:  as  a  coefficient  and  is  valid  for  any  number  of  variables.  (4.126) 
was  derived  by  Sugiura(1973)  in  an  elementary  way.  Because  his  exposition  is  clear  and 
readable  (except  that  there  are  complications  like  a  higher  order  partial  differential  equation 
and  differential  equations  for  complex  zonal  polynomials)  we  do  not  derive  it  here.  Let 

(4-127)  yP(A)  =  bpqMq{A). 

9<P 

Substituting  this  into  (4.126)  we  obtain 


Ev, 

9<P 


=  °i(p)  2  bp^uiA)- 

?<p 


(4.128) 
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Now 

(4.129) 


E  E 

.=1 


a,ay  d 
di  —  ay  da,- 


W,(A) 


can  be  expressed  as  sums  of  monomial  symmetric  functions.  Then  comparing  both  sides 
of  (4.128)  we  can  determine  the  coefficients  bpq.  It  is  hard  to  visualize  what  is  going  on 
here  unless  one  works  out  some  examples.  Muirhead(1981)  does  that  very  carefully  using 
(4.120)  rather  than  (4.126).  See  James(1968)  also.  Therefore  we  only  sketch  the  procedure 
here. 


Let  q  =  (qi, . . . ,  qe).  It  is  fairly  straightforward  to  verify  that 


(4.130) 


d 2  1 

53  a'  Qofi  ^9  =  53  ~ 


t=  1 


(4.131) 


da,-  * 


aiaj 

t^.a,--ay 


i 

—  ^  qi(i  —  1)A19  +  lower  order  terms. 

»=i 


Adding  (4.130)  and  (4.131)  we  obtain 


(4.132) 


DMq  =  oi(<z).M9  +  lower  order  terms, 


where 


d 2 


0=  +  \ ^  _OCidi _ d_ 

1  dai  <*i  -  <X]  dai 


It  is  fortunate  to  get  only  lower  order  terms  by  the  differential  operation.  It  is  this  triangular 
nature  of  the  differential  operation  that  enables  one  to  determine  bpq  recursively  starting 
from  the  (arbitrary)  leading  coefficient  bpp.  If  one  works  out  “lower  order  terms”  (which  is 
not  hard)  one  arrives  at  the  following  rule  by  James(1968): 


(4.133) 


P<7 


=  E 

q<q'<p 


((ft  +  r)  -  (gy  -  r))b, 
oi(p)-ai(g) 


where  q'  is  an  unorderd  partition  of  the  form  q’=[qi, . . . ,  g,  +  r, . . . ,  gy  —  r, . . . ,  qt{q)),  (1  < 
r  <  qj).  The  summation  is  over  all  ( i,j,r )  where  i  <  j,  r  >  1  such  that  when  the 
unordered  partition  q'  is  ordered  we  have  q  <  q'  <  p. 
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4.6.  Coefficients  of  Tq  in  Zp 

Actually  in  view  of  Theorem  4.1  we  have  to  consider  only  partitions  q,  q'  which 
are  majorized  by  p. 

T-  he  advantage  of  this  method  is  that  it  is  self-contained.  It  gives  all  coefficients  of 
a  single  zonal  polynomial  without  computing  others.  Therefore  it  is  by  far  the  best  method 
if  one  is  interested  in  computing  few  zonal  polynomials.  On  the  other  hand  if  one  wants  to 
compute  many  zonal  polynomials  then  relying  exclusively  on  this  method  seems  to  involve 
a  great  deal  of  redundant  computations  in  view  of  recurrence  relations  of  Section  4.5.3. 

Remark  4.6.  Logically  the  recurrence  relation  (4.133)  is  not  complete  until  one  shows 
that  the  denominator  ai(p)~  a^q)  is  never  zero.  James(1968)  states  that  (4.133)  gives  rise 
to  positive  bpq’s.  Since  the  numerator  ( ft+r)-  (9j- -r)=(gt- -  9y)+  2 r  is  positive  he  seems 
to  claim  that  ax(p)  —  ai(q)  >  0  for  all  relevant  pairs  p,q.  By  Theorem  4.1  it  is  enough  to 
prove 

(4.134)  ai(p)  —  ai{q)  >  0  forp>~q. 

Then  this  ensures  that  (4.133)  works  for  all  cases  and  nonzero  bpq’s  are  positive.  (4-134) 
can  be  easily  proved  using  techniques  from  the  theory  of  majorization.  See  Marshall  and 
Olkin(1979).  We  do  not  go  into  this  here. 

§4.6  Coefficients  of  Tq  in  Zp 

In  this  section  we  study  the  coefficients  of  Tp.  The  normalization  Zp  seems  to  be 
most  advantageous.  An  important  fact  about  the  coefficients  of  Tp  is  their  orthogonality. 
We  derive  this  first.  See  formulas  (117)and  (118)  in  James(1964),and  Problem  13.3.9  in' 
Farrell(1976). 

For  p  6  Pn  let 

(4-135)  Zp=  9PqTq. 

Let  Z=  (Z(n),  . . . ,  Z^in^Y,G=(gpq).  Then  (4.135)  can  be  expressed  in  a  matrix 

form  as 


(4.136) 


Z=GT. 
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Now  we  recall  that  the  transition  matrix  Fin  T—Fll  is  lower  triangular  ((2.37)).  Substi¬ 
tuting  this  into  (4.136)  we  obtain 

(4.137)  Z  =  GFU. 

But  Z  =  BU.  Hence 

(4.138)  G  =  BF~l 

Now  (3.87)  shows 

(4.139)  S'DS  =  F'CF, 

where  C  =  diag(cp,p  £  Pn)  is  obtained  in  (3.84)  and  jD=diag(dp,  p  £  Pn)  is  known  as 
(3.88).  From  (4.138)  and  (4.139)  we  obtain 

(4.140)  G'DG  =  C. 

Inverting  this 

(4.141)  GC-'G'  =  D~l. 

Coordinatewise 


(4.142) 

dpdpqQpq'  —  fiqq'Cq, 

p 

( column  orthogonality), 

(4.143) 

9pq9p'q/cq  ~  ^pp'/dp, 

9 

( row  orthogonality), 

where  <5pp/ 

is  Kronecker’s  delta. 

Actually  cq,q  £  Pn  coincide  with  the  elements  of  the  first  row  of  G.  To  see  this 
let  Pi  =  1,  fa  =  •  •  •  ==  /3fc=0  in  (3.85).  Then  clearly  Tp(Ii)=l  for  every  p  and  we 
have  Z(„)(A)=^cp  TP(A)  (see  (4.66))  and  this  proves  the  claim.  Therefore  (4.143)  can  be 
written  alternatively  as 

yi  9pq9p'q/ 9(n),q  —  ^pp'/^p* 

9 


(4.144) 
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4.6.  Coefficients  of  Tq  in  Zp 

One  obvious  advantage  of  working  with  Tp’s  is  that  the  coefficient  matrix  is  readily  inver¬ 
tible.  From  (4.140) 

(4.145)  G_1  =  C-lG'D. 

Therefore  once  we  express  zonal  polynomials  in  terms  of  T^’s  then  it  is  easy  to  express  Tps 
(and  their  linear  combinations)  in  zonal  polynomials. 

(4.144)  was  used  to  compute  zonal  polynomials  in  Parkhurst  and  James(l974)  as 
follows,  (i)  Up's  are  expressed  in  7^,’s.  (ii)  They  are  Gram-Schmidt  orthogonalized  relative  to 
the  orthogonality  relation  (4.144)  starting  from  the  lowest  partition  (1")  upwards.  Because 
of  the  triangularity  of  3  this  clearly  results  in  zonal  polynomials. 

Some  gpq's  can  be  explicitly  obtained  using  the  fact  Zp(Ik)=\kp.  We  regard  \kp 
as  a  function  in  k.  Then  by  (3.21)  it  is  a  polynomial  in  k  of  degree  |p|  =  n.  Now  since 
tr[Ik)=k  for  any  r  we  obtain  Tp(Ifc)==A;Pl_P2  &P2~P3- •  -  =kPl  =  kh W.  Therefore  putting 
Ik  in  (4.135)  we  obtain 

(4-146)  Xfcp  =  £  gpqkhM. 

This  uniquely  determines  gpq  for  q  =  (n),  q  =  (n  -  1,  l),qr  =  (ln)  because  these  are  the 
only  partitions  in  Pn  with  h{q)  —  n,n  —  1,1  respectively.  Now  the  leading  coefficient  of  \kp 
is  1,  hence 

(4.147)  gpM  =  1. 

(4.147)  was  originally  used  by  James  to  determine  the  normalization  Zp.  Now  let  us  look 
at  the  coefficient  of  A;”-1  in  \kp.  It  is 

*(p) 

^{(-z  +  1)  +  (-*'  +  3)  +  •••  +  (-*  +  1  +  2 Pi  -  2)} 

»'=i 

<(p)  x 

—  oP*{(— *  +  1)  +  (— *  +  1  +  2 pi  —  2)} 

«=i 
t(p) 

=  X)  p*'(pi  “  0 

t=i 

=  ai(p). 


(4.148) 
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ai(p)  already  appeared  in  (4.121).  Therefore 


(4.149) 


ffp,(n-l,l)  =  al(p)- 


This  is  mentioned  in  the  introductory  part  of  Parkhurst  and  James(1964)  in  a  somewhat 


mysterious  form 


(4.150) 


*(p)  ,  Mp) 

(n-1,1)  =  Z)  “  X)  “  o  £  ~  !)» 


where  p'  =  (pj, . . .  »Ph(p))  is  the  conjugate  partition  of  p.  (4.149)  and  (4.150)  seem  to  agree. 


Now  the  coefficient  of  k  in  \kP  is 


*(p)  (  'A 

(4.151)  {J.  4...  (Jw-JM-lXl)- ••(>»- *)>-  =  *-‘(n  - 1)' n  ( — 2“)  • 


Hence 


(4.152) 


t(p)  /  •  i  \ 

9p,(  i») = 2n-i(pi  - 1)!  n  ( — 

i  =  2  v  'r 


This  does  not  seem  to  have  been  noticed. 


Now  from  (4.138) 


(4.153) 


B=GF,  B  =((„),  F=(fpq). 


Fis  lower  triangular.  Therefore  the  last  column  of  5  is  /(i»)(i")  times  the  last  column  of 
G.  By  (2.33) 


(4.154) 


T(i»)  =  tn  =  (— l)n  - ). 


Hence  /( i»)(i")  =  (—1)"  X«.  Therefore 


(4.155)  £P,( i»)  =  /(i»)(i»)flfP,(i»)  —  (~2)n  Mpi  -  1)! 


t=2  v  'Pi 


Making  use  of  (4.145)  gives  another  set  of  identities. 


(4.156) 


T  =  G-1Z  =  C~lG,DZ. 
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4.7.  Variations  of  the  integral  representation  of  zonal  polynomials 


Now  DZ  =  (d(„)2(n),...,rf(1»)Z(in))'  and  dpZp  is  denoted  by  Cp  (3.89).  Therefore 
(4-157)  r=c-1G'(c(n),...,c(1n)y. 

Comparing  the  second  element  we  obtain 


(4.158) 


^(n-M)  =  t 


n— 2 


h 


1 

C(n-l,l) 


ai{p)cP 

pe/>« 


_1 _ 

n(n  —  1) 


H  ai(p)Cp, 
p£A 


where  C(„_lilj  =  n(n— 1)  is  given  by  (3.84).  (4.158)  was  given  by  Sugiura  and  Fujikoshi(1959) 
by  a  different  method.  They  derive  more  identities  of  this  kind.  Now  looking  at  the  last 
element  we  obtain 


(4.159) 


7(i»)  =  tn  =  M 


(n) 


-  E  e- •(»-»>' 2  (-41) 

11  >  pePn  »==  2  v  'Pi 

V  ’  pePn  i=  2  V  'Pi 


What  are  advantages  and  disadvantages  of  working  with  7^,’s?  One  advantage  is 
that  we  do  not  have  to  compute  characteristic  roots  of  A  to  compute  TP(A).  (One  only 
needs  traces  of  powers  of  A.)  Another  advantage  is  the  orthogonality  discussed  above.  A 
serious  drawback  of  Tp  is  that  we  have  to  compute  TP(A)  for  all  p  G  Pn  even  if  the  rank  of  A 
is  small.  In  usual  statistical  computations  rank  A  is  fixed  and  not  too  big.  It  is  a  covariance 
matrix  for  example.  Since  the  number  of  partitions  grows  very  fast  as  n  increases  if  one 
wants  to  compute  Zp(A)  for  |p|  big  it  seems  better  to  use  Uq’s  or  M,’s.  The  growth  of  the 
number  of  partitions  p  with  i(p)  <  k  (k  :  fixed)  is  much  smaller  than  the  growth  of  the 
number  of  all  partitions.  See  Table  4.1  in  David,  Kendall,  and  Barton(1966). 


§4.7  Variations  of  the  integral  representation  of  zonal  polynomials 

In  this  section  we  explore  various  variations  of  the  integral  representation  (Theorem 
3.7)  discussed  in  Section  3.3.  We  first  replace  U  by  the  k  X  k  uniform  orthogonal  matrix 
H. 
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Theorem  4.5.  (James,  1973)  For  k  X  k  symmetric  A 


yP(A) 


(4.160) 


yP(h) 


i=  1 


where  p  =  (pi, . . . ,  pt)  (E  Pn  and  the  k  X  k  orthogonal  H  is  uniformly  distributed. 

Proof:  As  in  Lemma  3.2  it  is  easy  to  check  that  £//{A ^1_P2  •  •  •  AP/}  €  Vntk{A).  Therefore 
we  can  write 


(4.161) 


£hW-p°- 


q€P„A  9)<k 


Replacing  A  by  UAU'  where  k  X  k  U  is  as  in  Theorem  3.7  and  taking  expectation  with 
respect  to  U  we  obtain 


(4.162) 


Zp[A)  —  ^  ^  Oq\kqZq[A). 

q€Pn.Aq)<k 


This  being  true  for  any  symmetric  k  X  k  A  we  conclude  from  Theorem  4.1 


aq  —  0,  q  p,  ap  =  = 


1 


*p  \kp  zp(iky 

Since  (4.160)  is  independent  of  normalization  we  can  have  yp  instead  of  Zp  in  (4.160).  | 

Corollary  4.4.  (Kates)  Let  X :  k  X  k  have  an  orthogonally  biinvariant  distribution 
then 


*  fAp.-p2  yp(A)£x{yp(XX)} 

£x{*1  "  WW*  • 


(4.163) 

where  A»  =  XAXf(\, . . . ,  *). 

Proof :  We  replace  X  by  H1XH2  where  H\  and  H%  are  independently  uniformly  dis¬ 
tributed.  The  distribution  of  X  is  unchanged.  Now  taking  expectation  with  respect  to  Hi 
(Theorem  4.5)  and  H2  (Theorem  3.3)  successively  we  obtain 

l 

WAP-A?}  =  £Hux,HAjl[H1XH2AH,2XHl1(l,...,i )]»-"+>} 


(4.164) 


»=i 


=  £x,HAyP(XH2AH2Xf)}/yp(Ik) 
=  yp{A)£x{yp{xfX)}/{yp(ik)}2, 


1 
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Remark  4.7.  As  in  Remark  3.11  X  can  be  rectangular.  If  X  is  m  X  k,  then  {]/p(Jjfc)}2 
on  the  right  hand  side  of  (4. 164)  m  replaced  by  yp{Ik)  yp[Im)- 

An  easy  modification  of  the  above  formulas  produces  another  set  of  identities. 

Theorem  4.6.  Let  Ui,U2  be  k  X  A;  matrices  whose  entries  are  independent  standard 
normal  variables.  Then  for  k  X  k  A. 

t 

(4.165)  1  bpZp(AA')  = 

t=i 

where  ybp  is  given  by  (4.33). 


Proof:  Let  the  singular  value  decomposition  of  A  be  A—FiDr2  where  Ti,  P2  are  or¬ 
thogonal, Z^diag^,  . . .  ,£fc)  and  6 2,  ...,  6\  are  the  characteristic  roots  of  AA!.  Since 
the  order  of  6 1,  ...,  6 k  and  the  sign  of  each  5,  can  be  arbitrary  in  the  singular  value 

decomposition  we  see  that  (4.165)  is  a  homogeneous  symmetric  polynomial  in  . Sl 

Furthermore  (4.165)  does  not  change  when  A,XJ\,U2  are  augmented  as  in  the  proof  of 
Theorem  3.7.  Therefore  to  prove  that  (4.165)  is  a  zonal  polynomial  it  suffices  to  check 
whether  it  satisfies  the  condition  of  Theorem  3.1. 

Now  replace  A  by  U3 A  where  U3  is  independent  of  U\,U2.  Then 


(4.166) 


£vuUi,U3  {  n  [Ui u3 AU2{\, . . . ,  } 

i=i 

t 

=  £ui,U2,U3  { II  [UsUiAUtll, . . . ,  i)]*w-*w+i } 

«=i 

t 

=  £uuu3,t,h{  n  [THUXAU2{1, . .  .,i)]2P<-2pi+l} 

»=i 

t 

= [^1^2(1.  •  •  • ,  *)]2p<"2p<+i}, 

»=i 


whore  T,H  are  as  in  Lemma  3.7.  Now  to  obtain  the  normalizing  constant  we  put  A=Ik- 
Then 
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u 


(4.167) 


i&pXfcp  =  SuuuAjl  PW*(1.  2p<+1} 

»=i 

»=i 

=  \kpSu. { Ru2(1,  ,•)*«-*«+! >. 


t'=l 


Hence  j6p  =  £c/{ri{=i  t/(l»  •  •  • >'  t)2p<~ 2pi+1 }.  Now  consider  (3.70)  and  (3.76).  Then  we  see 
that  the  coefficient  of  •  •  apt  is  given  just  by  £t/{ FIi=i  E/(l>  •  • . ,  t)2pi-2p<+l }.  Therefore 
it  is  the  leading  coefficient  of  Zp  and  is  given  by  (4.33). 


Corollary  4.5. 


(4.168) 


1  Pz  P(/fc)  ^  ==  £huU2 


»=i 


(4.169) 


'twiP = . i)]2--2-*'  >. 


t=i 


Proof:  (4.168)  and  (4.169)  can  be  proved  successively  as  in  the  proof  of  Theorem  4.5. 


Corollary  4.6.  Let  X i,-X"2  be  independent  and  have  orthogonally  biinvariant  distribu¬ 

tions.  Then 


(4.170) 


7  f  4  a*  i b^iZpix^ysxAZp&xtt} 

pl  J  {zP{h)Y 

t 

=  £xltx2 {  n  l-*i  AXa(l, . . . ,  *)]2p<-2p<+l }. 
1=1 


Proof :  Replace  Xi  by  HiXiH^  and  X^  by  H4X2H2.  Then  taking  expectation  with 
respect  to  H$,H4  successively  we  obtain  (4.170).  | 

Remark  4,8.  Generalization  to  rectangular  matrices  is  straightforward. 


Chapter  5 

Complex  zonal  polynomials 


In  this  chapter  we  study  complex  zonal  polynomials,  i.e.  zonal  polynomials  as¬ 
sociated  with  the  complex  normal  and  the  complex  Wishart  distributions.  The  complex 
multivariate  normal  distribution  is  used  in  the  frequency  analysis  of  multiple  time  series 
and  complex  zonal  polynomials  are  useful  for  noncentral  distributions  arising  in  this  set¬ 
ting.  Other  than  that  the  practical  applicability  of  complex  zonal  polynomials  seems  rather 
limited.  Actually  our  main  reason  of  studying  them  is  that  they  are  simpler  than  real  zonal 
polynomials.  If  one  compares  Farrcll(1980)  and  Chapter  1  of  Macdonald(1979)  it  becomes 
apparent  that  complex  zonal  polynomials  are  the  same  as  homogeneous  symmetric  poly¬ 
nomials  called  the  Schur  functions  and  the  latter  have  been  extensively  studied.  We  will 
make  this  connection  clear.  Hopefully  developing  complex  zonal  polynomials  gives  further 
insights  into  the  real  case. 

The  theory  of  the  complex  normal  and  the  Wishart  distributions  very  closely, 
parallels  that  of  the  real  case  (see  Goodman(1963)  or  Brillinger(1975))  and  it  turns  out  that 
our  development  of  Chapter  3  and  Chapter  4  can  be  directly  translated  into  the  complex 
case.  It  seems  customary  to  put  a  ~  to  denote  corresponding  objects  in  the  complex  case. 
For  example  we  use  Zp,  Cp,  ]Jp,  etc.  With  this  convention  the  translation  of  the  results  in 
Chapter  3  and  4  are  almost  immediate. 

§5.1  The  complex  normal  and  the  complex  Wishart  distributions. 


We  give  a  brief  summary  of  the  complex  normal  and  the  complex  Wishart  dis- 
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tributions.  Let  x,y  be  independently  distributed  according  to  >/(0,l/2)  and  let  z  —  x  +  iy. 
z  is  said  to  have  the  standard  complex  normal  distribution.  Or  we  say  that  z  is  a  standard 
complex  normal  (random)  variable.  Now  let  A  be  an  n  X  n  matrix  with  complex  elements 
and  let 

(5.1)  u  =  (‘Ui  j  •  •  •  j  Wfc)  =  A  (z\, . . .  t  £fc)  i 


where  z\,...,Zk  are  independent  standard  complex  normal  variables.  This  scheme  generates 
a  family  of  distributions  called  the  multivariate  complex  normal  distribution.  Its  density 
(with  respect  to  Hi  d(5Rut)  ni^('*'u»)  )  is  given  by 


(5.2)  f{u)  =  exp(-u*.£  lu) 

where  *  means  conjugate  transpose  and  E  =  £uu*  =  AA*.  If  u  has  the  density  (5.2)  we 
denote  this  by  u~  CM{0,  E).  Now  suppose  that  ttj, . . . ,  un  are  independently  distributed 
according  to  CM[0,  2?).  Let  W=  ]T)”=1  u*ui  •  The  distribution  of  W"is  called  the  complex 
Wishart  distribution  and  its  density  (with  respect  to  n<=i^*t  II,<j d(8?uty)d(Q£t>ty))  is 
given  by 


(5.3) 


/M  = 


|«j|n  fcexp(— trS  1u>) 
ttpIp-1)/2  nf=i  r(n  -  i  +  1)|X?|» 


This  distribution  is  denoted  by  CW(E,n). 

«  M 

Let  W  =  TT  be  the  (unique)  triangular  decompsition  of  a  positive  definite 
Hermitian  matrix  where  T—fJij)  is  a  lower  triangular  matrix  with  positive  diagonal  ele¬ 
ments.  Analogous  to  Lemma  3.3  we  have  the  following  lemma. 


Lemma  5.1.  Let  W  be  distributed  according  to  CW(Ik,  t').  Let  W  =  TT  .  Then  tij, 
i  >  j ,  are  independently  distributed.  \pi ta  ~  x(2(i/  —  i  +  1))  and  tij,  i  >  j,  are  standard 
complex  normal  variables. 


Proof:  See  Goodman(1963),formula(1.8)  I 
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_2 

Remark  5.1.  /ias  f/ie  gamma  density  f[x)  =  (l/r(i/  —  i  +  l))®*'-1’®-*. 

With  this  lemma  we  are  ready  to  translate  the  results  of  Chapter  3  and  4. 


§5.2  Derivation  and  properties  of  complex  zonal  polynomials 

For  ease  of  comparison  of  the  results  here  and  the  results  of  Chapter  3  and  4  we  will 
consistently  put  ~  on  corresponding  objects  of  the  complex  case.  This  sometimes  results 
in  somewhat  unnatural  notation,  for  example  if  H  is  orthogonal  then  H  is  unitary  etc.  So 
much  for  the  notation  and  now  let  us  follow  the  development  of  real  zonal  polynomials  step 
by  step  for  a  while.  All  proofs  will  be  omitted  since  they  are  the  same  for  the  real  and  the 
complex  cases. 

We  consider  the  following  transformation. 

(5-4)  mp)(a)  =  ew{up{AW)}, 

where  A  is  Hermitian  and  W  —  CW{Ik,  t>). 

Lemma  5.2.  (corresponding  to  Lemma  3.2)  If  A  is  Hermitian,  then 
(5-5)  Mp)(A)  e  Vn(A). 


Corollary  5.1.  (Corollary  3.1) 


(5.6) 

where  A  is  Hermitian. 


(A/^p)(A)  —  \upUp{A)  +  apqUq(A), 

9<P 


Corollary  5.2. 


(5.7) 


(Corollary  3.2) 


e(p) 

=  TT 


«=i 


r(pi  +  i/  +  i  -  i) 

T(v  +  1  -  »') 


<(p) 

=  JJ  [u  +  1  -  t)p. 

*=i 

—  v(v  +  1)*  •  •  [v  +  Pi  —  1) 

{u  -  1>. .  •  [y  -  1  +  p2  -  1) 


•{v  —  —  1  +  pi) 
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where  l  =  £(p)  and  ( a)k  —  a[a  +  1)-  •  •  (a  +  k  —  1). 

Corollary  5.1  shows  that 

(5.8)  t„(U)  =  TVU , 

where  T„  is  an  upper  triangular  matrix  with  diagonal  elements  tpp  =  X„p. 

Lemma  5.3.  (Lemma  3.4)  There  exists  an  upper  triangular  matrix  3  such  that 

(5.9)  BT„  —  AUB  for  all  v. 

where  Au  =  diag(XI/p,  p  6  Pn).  S  is  uniquely  determined  up  to  a  (possibly  different) 
multiplicative  constant  for  each  row. 

Using  this  B  we  define  complex  zonal  polynomials. 

Definition  5.1  Complex  zonal  polynomials 

Let  S  be  as  in  Lemma  5.3.  Complex  zonal  polynomials  l/p,  p  £  Pn  are  defined 
by 

f  £(«) 

P(n— 1,1) 

(5.10)  y  = 

Lemma  5.3  is  a  consequence  of  the  fact  that  there  exists  Uq  for  which  X„op,  p  6  Pn 
are  all  different  and  the  following  lemma. 

Lemma  5.4.  (Lemma  3.5) 

(5.11)  TVTP  =  TPTV. 

We  summarize  these  results  in  the  following  theorem. 
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Theorem  5.1.  (Theorem  3.1)  Let  |/p  be  a  complex  zonal  polynomial  then 

(5.12)  £^yp{AW)  =  KPyp{A), 

where  W  ~  CW(Ik,  v),  A  is  Hermitian,  and  \up  is  given  by  (5.7).  Conversely  (5.12)  (for 
all  sufficiently  large  v)  implies  that  ]jp  is  a  complex  zonal  polynomial. 

Now  we  explore  various  integral  identities  satisfied  by  complex  zonal  polynomials. 

Definition  5.2  A  random  unitary  matrix  His  said  to  have  the  Haar  invariant  distribution 
or  the  uniform  distribution  if  the  distribution  of  HJT  is  the  same  for  every  unitary  P. 

More  formally 

Definition  5.2'  A  probability  measure  P  on  the  Borel  field  of  unitary  matrices  is  Haar 
invariant  if 

(5.13)  P(A)  =  P(AJ), 

for  every  unitary  P  and  every  Borel  set  A. 

Existence  and  uniqueness  of  the  Haar  invariant  distribution  are  established  by  the 
following  two  lemmas. 

Lemma  5.5.  (Lemma  3.6)  Let  two  probability  measures  P\ ,  P2  satisfy  (5.13).  Then 
Pi  (A)  —  P2(A)  for  every  Borel  set  A.  Furthermore  Pi  (A)  =  Pi  (A*)  where  A*  =  {H  | 
He  A}. 

Lemma  5.6.  (Lemma  3.7)  Let  U  —  (utJ)  be  a  k  X  k  matrix  such  that  Uij  are  inde¬ 
pendent  standard  complex  normal  variables.  Then  with  probability  1  U  can  be  uniquely 
expressed  as 

(5.14)  U  =  TH, 

where  T  =  (<tJ)  is  lower  triangular  with  positive  diagonal  elements  and  H  is  unitary. 
Furthermore  (i)  T,H  are  independent,  (ii)  H  is  uniform, (Hi)  tij  are  all  independent  and 

\/2  ta  ~  x(2(fc  - 1  +  1))  ti};  i  >  j,  ~  CA/(0, 1).  . 

Now  we  obtain  the  “splitting  property”  of  complex  zonal  polynomials. 
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Theorem  5.2.  (Theorem  3.2)  Let  A,B  be  k  X  k  Hermitian  matrices.  Then 

(5.15)  £kyp{AHBH)  =  yriA)yp{B)/X/r(It), 

where  k  X  k  unitary  H  has  the  uniform  distribution. 

Definition  5.3  A  random  Hermitian  matrix  V  is  said  to  have  a  unitarily  invariant 
distribution  if  for  every  unitary  T,  TVT  has  the  same  distribution  as  V. 

As  in  the  real  case  Theorem  5.1  generalizes  to  unitarily  invariant  distributions. 

Theorem  5.3.  (Theorem  . 3. 3)  Suppose  that  V has  a  unitarily  invariant  distribution, 

then  for  Hermitian  A 


(5-16)  £vl/MV)  =  cpX/p(A), 

where 


(5.17) 


Unitarily  invariant  distributions  are  characterized  as  follows. 


Lemm'a  5.7.  (Lemma  3.8)  Let  V  =  TI DH  where  H  is  unitary  and  D  is  diagonal. 
Let  H  and  D  be  independently  distributed  such  that  H  has  the  uniform  distribution.  (Diagonal 
elements  of  D  can  have  any  distribution.)  Then  V  has  a  unitarily  invariant  distribution. 
Conversely  all  unitarily  invariant  distributions  can  be  obtained  in  this  way. 

We  can  replace  H  in  Therem  5.2  by  U  whose  elements  are  independent  standard 
complex  normal  variables. 


Theorem  5.4.  (Theorem  3.4)  Let  U  =  [uij)  be  a  k  X  k  matrix  such  that  Uij  are 
independent  standard  complex  normal  variables.  Then  for  Hemitian  A,B 


(5.18) 


£0i)p(AUBU)  = 


Xjc. 


yPw 


As  in  the  real  case  this  leads  to  the  following  observation. 
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Theorem  5.5.  (Theorem  3.5)  bp  —  ^kp/yp{Ik)  13  «  constant  independent  of  k. 

Unitarily  biinvariant  distributions  are  defined  in  an  obvious  way. 

Definition  5.4  A  random  matrix  X  has  a  'unitarily  biinvariant  distibution  if  for  every 
unitary  Ji,r2,  the  distribution  of  TiXA  is  the  same  as  the  distribution  of  X. 

Now  Theorem  5.2  and  Theorem  5.4  generalize  as  follows. 

Theorem  5.6.  (Theorem  3.6)  If  X  has  a  unitarily  biinvariant  distribution  then  for 

Hermitian  A,B 


(5.19) 
where 

(5.20) 


ekyp[A±Bx)  =  tPyr(A)yr{B), 


■h  =  ^{yr{xx))i{yr{ik)y 


Characterization  of  unitarily  biinvariant  distributions  can  be  given  in  an  obvious 

way. 

•v; 

Lemma  5.8.  (Lemma  3.9)  Let  X  —  HiDH2  where  Hi,!!?  are  unitary  and  D  is 
diagonal.  Let  Hi,H2,D  be  independently  distributed  such  that  Hx,  H2  have  the  uniform 
distribution.  (D  can  have  any  distribution.)  Then  X  has  a  unitarily  biinvariant  distribu¬ 
tion.  Conversely  all  unitarily  biinvariant  distributions  can  be  obtained  in  this  way. 


Remark  5.2. 
matrices  as  well. 


The  notion  of  unitarily  biinvariant  distributions 


applies  to  rectangular 


Now  we  take  a  look  at  the  integral  representation  of  zonal  polynomials  in  the 
complex  case. 

Definition  5.5  A  particular  normalization  of  a  zonal  polynomial  denoted  by  Zp  is  defined 
by 

(5-21)  ~ZP(Ik )  =  \kP, 

or  bp  =  1  in  Theorem  5.5. 
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Theorem  5.7.  (Theorem  3.7)  Let  p  —  ( Pi,...,pt )•  For  k  X  k  Hermitian  A 


(5.22)  ~Zp(A)  =  £0{aT  P2Ar  P3.--APt}, 

where  A  i  =  UAU  (1, . . . ,  i)  is  the  i  X  i  upper  left  minor  of  UAU  and  U  is  a  kXk  random 
matrix  whose  entries  are  independent  standard  complex  normal  variables. 

(5.22)  implies  that  ZP(A)  is  positive  for  positive  definite  A  and  increasing  in  each 
root.  Furthermore  using  the  Gale-Ryser  theorem  (see  Remark  4.1  and  Remark  4.2)  the 
coefficients  of  Mg  in  Zp  are  nonnegative  and  they  are  positive  iff  p  >-  q. 


As  in  the  real  case  ibp  denotes  the  leading  coefficient  of  Zp, namely 
(5.23)  '  zp=lbplyp. 


Theorem  5.8. 


(5.24) 


(Theorem  4-3) 

£(p)  »' 

i^p  =  JJ  JJ  (*  —  3  +  1  +  Pj  ~  Pi)pi-pi  +  i 

t'=  1  3  =  1 

UicjiPi-Pj -i  +  j)' 


Other  than  mentioning  Theorem  4.3  we  will  not  follow  the  development  of  Chapter 
4.  Of  course  all  the  results  of  Chpater  4  can  be  translated  into  the  complex  case  as 
has  been  done  so  far.  However, it  is  pointless  to  go  into  numerical  aspects  of  complex 
zonal  polynomials  because,  as  mentioned  above,  complex  zonal  polynomials  are  the  Schur 
functions  and  the  Schur  functions  are  already  well  studied.  Although  the  translation  of  the 
results  in  Chapter  4  presents  an  alternative  “probabilistic”  derivation  of  properties  of  Schur 
functions,  it  is  hardly  more  advantageous  than  a  well  developed  standard  approach  to  the 
subject.  We  will  present  the  approach  in  Macdonald(1979)  in  the  next  section.  The  link 
between  our  approach  so  far  and  the  one  in  Macdonald(1979)  is  given  by  Saw’s  generating 
function. 

Saw’s  generating  function  in  the  complex  case  was  introduced  by  Farrell(1980).  Let 
Uij  be  a  standard  complex  normal  variable.  Then  2|w,y  |2  =  2 Uiju'j  ~  x2(2)  (i.e.  |fzty|2  has 
the  standard  exponential  distribution).  Therefore  by  considering  £^{exp(0  tr  AUBU  )} 
where  A—  diag(aj, . . . ,  afc),  JB  =  diag(/3j, . . .  ,/?fc),  and  U  is  composed  of  independent 
standard  complex  normal  variables,  we  obtain 
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Therem  5.9.  (Theorem  8.8) 

(5.25)  IJt1  -  9“.Ar‘  =  E E 

*,J  n=0  pe^„ 

where  dp  is  determined  by 

(5-26)  (trA)n  =  £  dpZp(A). 

Coefficients  of  Zp  can  be  obtained  as  in  the  real  case,  namely  (i)  compare  the  coefficients 
of  0n  in  both  sides  of  (5.25),(ii)  express  the  left  hand  side  as  a  quadratic  form  in  Afp  or  Up, 
(iii)  do  the  triangular  decomposition  to  the  resulting  positive  definite  symmetric  matrix  of 
coefficients.  Now  it  will  be  shown  in  the  next  section  that  Schur  functions  Sp  satisfy  the 
same  generating  function(5.25)  and  $p  is  a  linear  combination  of  lower  order  M,’s  ( Sp  = 
52,<p  aPqMg).  Therefore  Schur  functions  agree  with  the  complex  zonal  polynomials  by  the 
uniqueness  of  the  triangular  decomposition  of  a  positive  definite  symmetric  matrix. 


§5.3  Schur  functions  and  their  determinantal  expressions. 

In  this  section  we  present  the  development  in  Chapter  1  of  Macdonald(1979) 
leading  to  Saw’s  generating  function.  (It  is  in  Section  1.4,  page  33  in  Macdonald’s  book.) 
This  part  of  the  book  is  entirely  elementary  and  forms  an  excellent  introduction  into  the 
theory  of  symmetric  functions.  In  addition  to  several  definitions  we  need  only  two  lemmas 
to  derive  Saw’s  generating  function.  In  order  to  be  self-contained  we  give  (detailed)  proofs 
of  them  following  Macdonald(1979). 

Let  p  —  €  P n>  The  Schur  function  Sp  (*i, . . . , x*)  ( k  >  i)  is 

defined  by 

$p(a;i,  •  ••,**)  =  det(z£,+  )i <i,j<k  /  det(xy  *)i<»,y<fc 


x\'+k~l  ... 

*r+fc-2  ... 

Pi+k-1 

xk 

P2+k-2 

Xk 

x\~x  ... 

€ 

xt1 

. 

m 

* 

• 

P 

xx 

Xk 

•‘'l  •  •  • 

xVk 

1 

1 

(5.27) 
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This  is  formula  (35)  in  James(1964).  If  k  <  t  we  define  $p(xi, . . . ,  =  0.  Note  that  the 

denominator  is  the  Vandermonde  determinant 

(5.28)  det(xy~*)  =  JJ  ( Xj  —  Xj). 

i<} 

Clearly  the  numerator  has  (x,  —  xy)  as  a  factor  because  if  xt-  =  Xj  then  det(xy<+fc~’)  —  0. 
Running  (i,j)  over  all  pairs  we  see  that  the  numerator  has  the  Vandermonde  determinant 
as  a  factor.  Furthermore  if  x»  and  Xj  are  interchanged  then  both  the  numerator  and  the 
denominator  change  the  sign  and  the  ratio  remains  the  same.  Therefore  Sp  (xi,...,Xfc)  is 
a  symmetric  polynomial  in  Xj’s.  It  is  easy  to  see  that  it  is  homogeneous  of  degree  |p|.  Now 
we  want  to  show  that 


(5.29)  Sp(x1,...,xfc,0)  =  Sp(x1,...,xfc). 

The  last  column  of  (x?*+fc+1_l)i<il3<fc+i  is 


(x 


Pi+fc  ~Pfc+ 1  p*+i\/ 

fc+l  >  •  •  •  >  xk+l  >  xk+ 1  )  ' 


If  Xfc+i  =  0  it  reduces  to  (0,. . .  ,0,1)'.  (Note  that  pjt+i  =  0  by  definition.)  Hence  if  x*;+x  = 
0  then  det(xy<+fc+1_t)  =  det(xy‘+fc_”1),  the  right  hand  side  being  the  k  X  k  principal  minor 
of  the  matrix  on  the  left  hand  side.  Similarly  det(xy+1~l)  =  det(x^_l).  Therefore  we  have 

(5.29)  aind  in  general  by  induction 

(5.30)  5p(xi,...,xfc,0...,0)  =  Sp(xi,...,xfc). 


This  shows  that  Sp  €  V„.  Now  let  us  look  at  the  highest  monomial  in  Sp  of  the  form 
axil"‘xfck  ((9i)  •  •  •  >Qk)  €  Pn)-  In  det(xyi+fc_l)  and  det(xy_l)  the  similar  terms  are 
obtained  by  the  products  of  the  diagonal  elements.  They  are 


Pi+fc — l  — 2  pk  k — 1 

XV  x2  •••xk>  X1 


respectively.  From  Sp(xi, . . . ,  x*)  det(xy  *)  =  det(xy<+K  *)  we  obtain 


Pi+ k-i\ 


iaxV ■  ‘ -xqk){x\  **  *  ’^fe-i)  ==  x 


..pi+fc-i. 


•  • X 


Pk 

k 


Therefore  a  —  1  and  q  =  (qi,---,qk)  =  (pi>  •  •  •  >Pk)  =  P ■  We  summarize  these  results  in 
a  lemma. 
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Lemma  5.9. 


(5-31)  Sp  =  Mp  +  aPlMg, 

9<P 

and  {  Sp,  p  €  Pn  }  forms  a  basis  of  Vn. 

Now  we  introduce  the  r-th  complete  symmetric  function  hT  which  is  the  coefficient 

of  sr  in 

k 

(5-32)  H{s)  =  JJ(1  -  sxi)~l. 

i=i 

For  convenience  we  let  h0  =  1,  hr  =  0  for  r  <  0.  Similarly  u0  =  l,ur  =  0  for  r  <  0. 
From  (2.28)  and  (5.32)  we  have 

(5-33)  H(s)U(-s)  =  1, 

or  equivalently 

n 

(5-34)  y](— l)rttrfen_r  ==  0,  to  =  1, 2, .... 

r=0 

Recursively  solving  (5.34)  we  can  express  ur’s  (and  their  products)  in  terms  of  hr' s  and  vice 
versa.  Defining 

(5-35)  Mp  =  hPlhP3- - -hpt,  p€Pn, 

we  see  that  {  Mp,p  €  Pn  }  forms  still  another  basis  of  Vn.  We  also  note  that  the  relation 
between  ur  s  and  hr  s  corresponds  to  the  relation  between  coefficients  of  autoregressive  and 
moving  average  representations  in  time  series. 

Lemma  5.10.  ((3.4)  in  Macdonald, (36)  in  James(1964)) 

Sp  =  det(hpi_i_|_y)j<t-jy<;fc 


hp  i 

hpi+i 

hpi+k—i 

(5.36) 

hpt-i 

hP2 

hP3+k- 2 

hPk-k+l 

hpk—k+2 

hpk 
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where  k  >  £{p). 

Proof:  We  work  with  k  variables  xi, . . .  ,Xk.  Let  denote  the  elementary  symmetric 
function  of  x\, . . . ,  xy_j,  scy+i, ...  ,Xk  (omitting  x3)  and  let  M  be  the  k  X  k  matrix 

(5.37)  M=((-l)fc-’4'2t.). 


For  p  6  Pn,£{p)  <  k  let  k  X  k  matrices  Ap,  Hp  be  defined  by 
(5-38)  Ap  =  (**),  Hp  =  (APi_i+J). 


Now 


fc-i 


U^\s)  =  52  u$»V  =  JJ(i  +  sxt). 


r— 1 


Therefore 


//(s)f/^(s)  =  (1  -  sxy)-1 

OO 

=  E  *'*;• 


r=0 


Equating  the  coefficient  of  spi  we  obtain 

k 


r= 1 


Therefore 


(5.39)  Hp  M  =  Ap. 

Taking  the  determinants  we  have 

(5.40)  |iTp||M]  =  |AP|  for  any  p  £  Pn. 

For  p  =  S  =  {k— 1,  k— 2, . . .,  1,0), Hp  is  an  upper  triangular  matrix  with  diagonal  elements 
l=^o-  Therefore  |i?g|=l  and 

|M)  =  jA$|  =  det(xy-*), 

which  is  the  Vandermonde  determinant.  Therefore 


(5.41) 


|HP||A,|  =  \A,\. 


5.3.  Schur  functions  and  their  determinantal  expressions. 


Now  replacing  p  by  p  +  6  we  obtain 


\AP+S\  =  det(x?+k~% 

which  is  the  numerator  of  (5.27).  Furthermore 

l-®p+$l  ==  det(fepi^.fc_  t-_  fc-j-y) 

=  det(/ip<_,+J). 

Hence 

\HP+s\  =  \Ap+s\/\As\ 

=  det(x^+A:-t')/det(*J-<) 
=  Sp. 


I 


Lemma  5.11.  ( (4-3)  in  Macdonald) 


(5.42) 


fc  CO 

n<i  -*..%)-■  =  E*”  E  w* **)  Vs..- 

*»J  n=0  peP„ 


Proof :  Replacing  xt-  by  0zt  we  can  assume  0  =  1  without  loss  of  generality.  Now  from 
(5.32) 


fit1 -*••%■)  1  =  f[  H(y,) 


y— t 

=  11(2  hr[*)yTj) 

j  r=0 

=  S  h«x(*Y Mvl1- 

= EW' «.(»). 

p 


(5.43) 
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where  (aj, . . . ,  a^)  run  over  all  fc-tuples  of  nonnegative  integers  and  p  runs  over  all  partition's 
(of  length  <  k).  Denoting  permutations  by  n 


(5.44) 


|A6(x)||A6(y)|  JJ(1  -  Xiyj)  1 

k 


=  |Ai(»)|QT]  sign(ir)  JJ  y-  **)  Y  hai- •  -h^y^1  •  •  -y%k 

*  *— 1  (a, . 


7T 

*  — 1  (ai,...,a*)eNfc 

=  \M*)\ 

E 

Y  ™9n(*)hai  •  •  -hakyr +k~*'  •  • 

(«i» 

...,afc)eN‘ 

7 r 

=  |A«(x)| 

E 

k  k 

cc  sign{< ?r)  n  hi>i-k+«i)  n  y* 

...fc*)eN* 

7T  t=l  »=1 

=  |Afi(x)| 

E 

k 

det (hbi„k+j)  JJ  yi*. 

(*ii- 

...i-OeN* 

»=i 

Now  note  that  if  5,  =  6,->  then  det(hj,._fc+J)=0.  Hence  we  can  assume  that  6,’s  are  all 
distinct  in  the  summation.  Suppose  bVl  >  b„3  >  •  •  •  ,  then 


det(/i6._fc+J)  =  det(h6,._fc+y)sz<7n(7r). 
Hence  using  (5.41)  we  obtain 


|As(cc)l  Y  det (h^-k+i)  f[  V? 


»=i 


(5.45) 


=  |A«(*)|  Y  det (Abi-fc+y)  Y  si9n(n)  JJ  y^ 

bl>--->bie>0  ir  t=l 

=  2Z  |A6,,...,6fc(»)||A6lj„.jfcfc(y)| 


bi>  —  >bk 


Now  the  set  of  all  fc-tuples  (6i,...,6fc)  such  that  6i  >  >  bk  >  0  agrees  with  the  set 

{  p  +  6  |  p  :  partition,  t{p)  <  k  }.  Therefore 


|A5(x)||A5(y)|  JJ(1  -  Xiyj)  1  =  Y  |Ai„...I6fc(x)||Afcli...i6fc(y)| 

(5.46)  *'*»  bi  >—>bk 

=  Y  |Ap+fi(*)||Ap+i(j/)|. 

pAp)<k 


This  proves  the  lemma.  B 
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Comparing  (5.25)  and  (5.42)  we  have 


(5.47) 


E  (~dp/n'.)~Zp(A)~Zp(B)  =  ]T  Sp(A)Sp(B) 

pePn  zep„ 


-  E  dM)  -SplB). 

pePn 


Hence  Sp  =  i  J/p.  Furthermore 


(5.48) 


dp  ib  —  n!, 


n  =  |p|. 


This  was  mentioned  at  the  end  of  Section  4.3. 

Remark  5.3.  There  are  two  more  determinantal  expressions  stated  in  James(1964). 
One  which  involves  elementary  symmetric  functions  (fomula(37)  in  James(1964))  w  found 
in  (2.9’)  of  Macdonald.  Formula  (38)  in  James(1964)  »  not  given  in  Macdonald. 

From  the  viewpoint  of  numerical  computation  these  determinantal  expressions 
seem  to  be  all  we  have  to  know.  For  a  given  matrix  we  calculate  the  roots  or  complete 
symmetric  functions  and  evaluate  the  determinant,  which  can  be  easily  done  by  computer. 
We  do  not  have  to  know  the  coefficients  of  Mp  or  Up  etc.  to  evaluate  the  Scb.ur  function.  It 
might  be  worthwhile  to  look  for  an  analogue  of  this  for  the  real  zonal  polynomial.  Another 
possibility  is  to  express  the  real  zonal  polynomials  in  terms  of  tip's.  Still  another  possibility 
is  to  express  the  real  zonal  polynomials  in  terms  of  the  Schur  functions. 


§5.4  Relation  between  the  real  and  the  complex  zonal  polynomials. 

We  finish  this  chapter  by  discussing  some  results  which  we  were  unable  to  derive 
by  our  elementary  approach.  James(1964)  gives  the  following  formula  relating  the  complex 
and  the  real  zonal  polynomials: 

(5'49)  = 

where  th  ekxk  H  has  the  uniform  distribution  of  orthogonal  matrices,  p  =  [plt . . . ,  pt)  € 
Pnr  and  2p  =  (2pi, ..., 2p/)  £  Pin-  (Formula  (34)  in  James(1964).)  Furthermore  he  states 


(5.50) 


SHSp{XIt)  =  0, 
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if  one  or  more  parts  of  p  is  odd.  (Formula  (40)).  See  also  Theorem  12.11.6  and  Remark 
12.11.11  in  Farrell  (1976). 


Given  these  results  we  can  evaluate  dp  in  (3.77)  as  follows.  First  note  that  by 
replacing  H  by  U  where  U  is  composed  of  independent  standard  (real)  normal  variables  we 
obtain 


(5.51) 


£uS2p{XU)  =  £t,hS2p{XTH) 

=  £tZp(XTT'X!)I  Zp(Ik) 
=  ZP{XX). 


By  (5.26)  and  (5.48) 


(5.52) 


(JrA)2”  =  J2  ~d*  ~ZM) 

P&Pin 

—  52 

P^Pin 

P€Ptn 


Now  let  A  —  diag(a1, . . . ,  a *)  and  replace  A  by  AU.  In  this  case 


Hence 


(5.53) 


trAU  =  £  aim,-  ~  M(0, 52  )• 


»=i 


£u{trAU)2n  =  1-3-  •  -(2 n  -  1)Q2  <*?)' 

(2 »)!,  "~ 


2nnl 


( trAA')n . 


On  the  other  hand  by  (5.52)  and  (5.51) 

r— 1 


(5.54) 


tv  E(2n)!^P  ^p(AV)=  52  (2n)!  lb2p  Zp{AA'). 

PZPln  P&Pn 


Hence  comparing  (5.53)  and  (5.54)  we  obtain 


(trAA')n  =  22  i b2p2nn\Zp(AA') 
pePn 


or 


(5.55) 

Now  (5.24)  gives  (3.88). 


dp  =  2nn\  ib2p. 


Appendix:  List  of  new  results 


Chapter  3: 

Proof  of  Lemma  3.2. 
Definition  3.1. 

Lemma  3.5. 

Theorem  3.3. 

Lemma  3.8. 

Theorem  3.5. 
Theorem  3.6. 

Remark  3.11. 

Proof  of  Theorem  3.7. 

Chapter 
Theorem  4.1. 

Proof  of  Theorem  4.2. 
Proof  of  Theorem  4.3. 
Proof  of  Lemma  4.4. 
Proof  of  Lemma  4.6. 
Lemma  4.7. 

Theorem  4.4. 

Corollary  4.3. 


Lemma  4.8. 

Formula  (4.105). 

Proof  of  Lemma  4.9. 

Lemma  4.10. 

Lemma  4.11. 

Lemma  4.12. 

Formula  (4.149). 

Formula  (4.152). 

Formula  (4.155). 

Formula  (4.159). 

Theorem  4.6. 

Corollary  4.5. 

Corollary  4.6. 

Chapter  5: 

All  results  in  Section  5.2  which  correspond  to 
the  new  results  in  Chapter  3  and  4. 

Derivation  of  (5.55)  given  (5.49)  and  (5.50). 
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